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Preface  to  the  Third  Edition 


The  goal  of  this  new  edition  is  the  same  as  that  for  the  original,  namely,  to 
present  a  one-semester  treatment  of  the  basic  ideas  encountered  in  partial  differ¬ 
ential  equations  (PDEs).  The  text  is  designed  for  a  3-credit  semester  course  for 
undergraduate  students  in  mathematics,  science,  and  engineering.  The  prereq¬ 
uisites  are  calculus  and  ordinary  differential  equations.  The  text  is  intimately 
tied  to  applications  in  heat  conduction,  wave  motion,  biological  systems,  and 
a  variety  other  topics  in  pure  and  applied  science.  Therefore,  students  should 
have  some  interest,  or  experience,  in  basic  science  or  engineering. 

The  main  part  of  the  text  is  the  first  four  chapters,  which  cover  the  essential 
concepts.  Specifically,  they  treat  first-  and  second-order  equations  on  bounded 
and  unbounded  domains  and  include  transform  methods  (Laplace  and  Fourier), 
characteristic  methods,  and  eigenfunction  expansions  (separation  of  variables); 
there  is  considerable  material  on  the  origin  of  PDEs  in  the  natural  sciences 
and  engineering.  Two  additional  chapters,  Chapter  5  and  Chapter  6,  are  short 
introductions  to  applications  of  PDEs  in  biology  and  to  numerical  computation 
of  solutions.  The  text  offers  flexibility  to  instructors  who,  for  example,  may  want 
to  insert  topics  from  biology  or  numerical  methods  at  any  time  in  the  course.  A 
brief  appendix  reviews  techniques  from  ordinary  differential  equations.  Sections 
marked  with  an  asterisk  (*)  may  safely  be  omitted.  The  mathematical  ideas 
are  strongly  motivated  by  physical  problems,  and  the  exposition  is  presented  in 
a  concise  style  accessible  to  students  in  science  and  engineering.  The  emphasis 
is  on  motivation,  methods,  concepts,  and  interpretation  rather  than  formal 
theory. 

The  level  of  exposition  is  slightly  higher  than  students  encounter  in  the 
post-calculus  differential  equations  course.  The  philosophy  is  that  a  student 
should  progress  in  the  ability  to  read  mathematics.  Elementary  texts  contain 
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many  examples  and  detailed  calculations,  but  advanced  mathematics  and  sci¬ 
ence  books  leave  a  lot  to  the  reader.  This  text  leaves  some  of  the  easy  details 
to  the  reader.  Often,  the  arguments  are  derivations  in  lieu  of  carefully  con¬ 
structed  proofs.  The  exercises  are  at  varying  levels  and  encourage  students  to 
think  about  the  concepts  and  derivations  rather  than  just  grind  out  lots  of  rou¬ 
tine  solutions.  A  student  who  reads  this  book  carefully  and  who  solves  many 
of  the  exercises  will  have  a  sound  knowledge  base  to  continue  with  a  second- 
year  partial  differential  equations  course  where  careful  proofs  are  constructed 
or  with  upper-division  courses  in  science  and  engineering  where  detailed,  and 
often  difficult,  applications  of  partial  differential  equations  are  introduced. 

This  third  edition,  a  substantial  revision,  contains  many  new  and  revised 
exercises,  and  some  sections  have  been  greatly  expanded  with  more  worked 
examples  and  additional  explanatory  material.  A  new,  less  dense,  format  makes 
key  results  more  apparent  and  the  text  easier  to  read  for  undergraduates.  The 
result  is  a  text  one-third  longer.  But  the  size  and  brevity  of  text,  contrary  to 
voluminous  other  texts,  struck  a  chord  with  many  users  and  that  has  been 
maintained.  Many  users  provided  suggestions  that  have  become  part  of  this 
revision,  and  I  greatly  appreciate  their  interest  and  comments. 

Elizabeth  Loew,  my  editor  at  Springer,  deserves  special  recognition  for  her 
continuous  and  expert  support.  I  have  found  Springer  to  be  an  extraordinary 
partner  in  this  project. 

Finally,  this  book  is  very  affectionately  dedicated  to  my  two  sons  and  daugh¬ 
ter,  Aaron,  David,  and  Rachel,  who  have  often  been  my  teachers  with  their 
challenging  and  unique  perspectives  on  life.  For  these  gifts  I  greatly  thank  you. 

I  welcome  suggestions,  comments,  and  corrections.  Contact  information  is 
on  my  web  site:  http://www.math.unl.edu/~jloganl,  where  additional  items 
can  be  found.  Solutions  to  some  of  the  exercises  can  be  found  on  the  Springer 
web  site. 

J.  David  Logan 
Willa  Cather  Professor 
Lincoln,  Nebraska 


To  Students 


Our  understanding  of  the  fundamental  processes  of  the  natural  world  is 
based  to  a  large  extent  on  partial  differential  equations.  W.  A.  Strauss 


Partial  differential  equations  (PDEs)  is  a  topic  worthy  of  your  study.  It 
is  a  subject  about  differential  equations  involving  unknown  functions  of  sev¬ 
eral  variables;  the  derivatives  are  partial  derivatives.  As  such,  it  is  a  subject 
that  is  intimately  connected  with  multivariable  calculus.  To  be  successful  you 
should  have  a  good  command  of  the  concepts  in  the  calculus  of  several  vari¬ 
ables.  So  keep  a  calculus  text  nearby  and  review  concepts  when  needed.  The 
same  comments  apply  to  elementary  ordinary  differential  equations  (ODEs). 
An  appendix  at  the  end  of  the  book  reviews  basic  solution  techniques  for  ODEs. 
If  you  wish  to  consult  other  sources,  the  texts  by  Farlow  (1993)  and  Strauss 
(1994)  are  good  choices. 

A  mathematics  book  must  be  read  with  a  pencil  and  paper  in  hand.  Ele¬ 
mentary  books  fill  in  most  steps  in  the  exposition,  but  more  advanced  books 
leave  many  details  to  the  reader.  This  book  has  enough  detail  so  that  you  can 
follow  the  discussion,  but  pencil  and  paper  work  is  required  in  some  portions. 
Verifying  all  the  statements  and  derivations  in  a  text  is  a  worthwhile  endeavor 
and  will  help  you  learn  the  material.  Many  students  find  that  studying  PDEs 
provides  an  opportunity  to  hone  their  skills  and  reinforce  concepts  in  calculus 
and  differential  equations.  Further,  studying  PDEs  increases  your  understand¬ 
ing  of  physical  principles  in  a  monumental  way. 

The  exercises  are  the  most  important  part  of  this  text,  and  you  should  try 
to  solve  most  of  them.  Some  require  routine  analytical  calculations,  but  others 
require  careful  thought.  We  learn  mathematics  by  doing  mathematics,  even 
when  we  are  stymied  by  a  problem.  The  effort  put  into  a  failed  attempt  will 
help  you  sort  out  the  concepts  and  reinforce  the  learning  process.  View  the 
exercises  as  a  challenge  and  resist  the  temptation  to  give  up.  It  is  also  a  good 
habit  to  write  up  your  solutions  in  a  clear,  concise,  logical  form.  Good  writing 
entails  good  thinking,  and  conversely. 


1 

The  Physical  Origins  of  Partial 

Differential  Equations 


Many  important  ideas  in  mathematics  are  developed  within  the  framework 
of  physical  science,  and  mathematical  equations,  especially  partial  differential 
equations,  provides  the  language  to  formulate  these  ideas.  In  reverse,  advances 
in  mathematics  provides  the  stimulus  for  new  advancements  in  science.  Over  the 
years  mathematicians  and  scientists  extended  these  methodologies  to  include 
nearly  all  areas  of  science  and  technology,  and  a  paradigm  emerged  called  math¬ 
ematical  modeling.  A  mathematical  model  is  an  equation,  or  set  of  equations, 
whose  solution  describes  the  physical  behavior  of  the  related  physical  system. 
In  this  context  we  say,  for  example,  that  Maxwell’s  equations  form  a  model 
for  electromagnetic  phenomena.  Like  most  mathematical  models,  Maxwell’s 
equations  are  based  on  physical  observations.  But  the  model  is  so  accurate, 
we  regard  the  model  itself  as  describing  an  actual  physical  law.  Other  mod¬ 
els,  for  example  a  model  of  how  a  disease  spreads  in  a  population,  are  more 
conceptual.  Such  models  often  explain  observations,  but  only  in  a  highly  lim¬ 
ited  sense.  In  general,  a  mathematical  model  is  a  simplified  description,  or 
caricature,  of  reality  expressed  in  mathematical  terms.  Mathematical  modeling 
involves  observation,  selection  of  relevant  physical  variables,  formulation  of  the 
equations,  analysis  of  the  equations  and  simulation,  and,  finally,  validation  of 
the  model  to  ascertain  whether  indeed  it  is  predictive.  The  subject  of  partial 
differential  equations  encompasses  all  types  of  models,  from  physical  laws  like 
Maxwell’s  equations  in  electrodynamics,  to  conceptual  laws  that  describe  the 
spread  of  an  plant  invasive  species  on  a  savanna. 
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1.  The  Physical  Origins  of  Partial  Differential  Equations 


1.1  PDE  Models 


In  this  book  we  examine  models  that  can  be  described  by  partial  differential 
equations.  The  focus  is  on  the  origin  of  such  models  and  tools  used  for  their 
analysis.  Of  particular  interest  are  models  in  diffusion  and  heat  flow,  wave  prop¬ 
agation,  and  transport  of  energy,  chemicals,  and  other  matter.  It  is  impossible 
to  overestimate  the  role  and  importance  of  PDEs  in  science  and  engineering. 

Readers  should  be  familiar  with  systems  governed  by  ordinary  differential 
equations  (ODEs).  For  example,  a  typical  ODE  model  in  population  ecology  is 
the  logistic  model 


du 

dt 


=  ru  (1  — — 


u 

K 


t  >  0, 


which  is  a  simple  equation  for  population  growth  where  the  per  capita  rate 
of  change  of  population,  u'(t)/u(t),  is  a  decreasing  function  of  the  population. 
Here  t  is  time,  and  u  =  u{t)  is  the  population  of  a  given  system  of  individuals. 
We  refer  to  u  as  the  state  and  say  that  the  evolution  of  the  state  variable  is 
governed  by  the  model  equation.  The  positive  numbers  r  and  K  are  given  phys¬ 
ical  parameters  that  represent  the  relative  growth  rate  and  carrying  capacity, 
respectively;  presumably,  r  and  K  can  be  measured  for  the  population  under 
investigation.  The  solution  to  the  logistic  equation  is  easily  found  by  separation 
of  variables  to  be 

UqK 


u(t)  = 


t  >  0, 


+  {K  -  uo)e~rt' 

where  ^(0)  =  uq  is  the  initial  population.  The  logistic  model  accurately 
describes  some  populations  having  a  sigmoid  growth  shape.  In  general,  an  ODE 
model  has  the  form 


du 

dt 


F(t,u;ri,...,rn),  t  >  0, 


where  F  is  a  given  functional  relation  between  t,  u ,  and  m  parameters 
ri, . . . ,  rm.  Often  the  model  includes  an  initial  condition  of  the  form  u( 0)  =  uo, 
where  uq  is  a  given  state  value  at  t  =  0.  More  generally,  an  ODE  model  may 
consist  of  a  system  of  n  ODEs  for  n  state  variables  u\(t), . . . ,  un(t). 

A  PDE  model  differs  from  an  ODE  model  in  that  the  state  variable  u 
depends  on  more  than  one  independent  variable.  ODEs  govern  the  evolution  of 
a  system  in  time,  and  observations  are  made  in  time.  PDEs  model  the  evolution 
of  a  system  in  both  time  and  space;  the  system  can  be  observed  both  in  a  time 
interval  and  in  a  spatial  region  (which  may  be  one-,  two-,  or  three-dimensional). 
PDE  models  may  also  be  independent  of  time,  but  depend  on  several  spatial 
variables.  Two  examples  of  PDEs  are 


utt{x,t)  —  c2uxx(x,t)  =  0,  (wave  equation) 
Uxx{x-,  V )  +  uyy(x,  y)  =  0.  (Laplace’s  equation) 
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The  wave  equation  describes  the  propagation  of  waves  in  a  one  dimensional 
medium.  The  unknown  function  u  =  u{x,  t)  is  a  function  of  position  x  and  time 
t.  In  Laplace’s  equation,  the  unknown  state  is  a  function  u  =  u(x,y),  where  x 
and  y  are  spatial  variables.  It  models,  for  example,  equilibrium  temperatures 
in  a  two-dimensional  region  of  the  plane  with  prescribed  temperatures  on  its 
boundary. 


Example  1.1 

(Heat  flow)  Consider  the  problem  of  determining  the  temperature  in  a  thin, 
laterally  insulated,  cylindrical,  metal  bar  of  length  l  and  unit  cross-sectional 
area,  whose  two  ends  are  maintained  at  a  constant  zero  degrees,  and  whose 
temperature  initially  (at  time  zero)  varies  along  the  bar  and  is  given  by  a  fixed 
function  <f>(x).  See  Figure  1.1. 


14  =  0 


0  x  lx 

Figure  1.1  A  laterally  insulated  metal  bar  with  zero  temperature  at  both 
ends.  Heat  flows  in  the  axial,  or  ^-direction,  and  u(x,  t)  is  the  temperature  of 
the  cross-section  at  x  at  time  t.  At  time  t  =  0  the  temperature  at  locations  x 
is  given  by  4>{x) 


How  does  the  bar  cool  down?  In  this  case,  the  state  variable  u  is  the  tempera¬ 
ture,  and  it  depends  upon  both  when  the  measurement  is  taken  and  where  in 
the  bar  it  is  taken.  Thus,  u  =  u(x,  £),  where  t  is  time  and  0  <  x  <  l.  The  equa¬ 
tion  governing  the  evolution  of  the  temperature  u  is  called  the  heat  equation 
(we  derive  it  in  Section  1.3),  and  it  has  the  form 

Uf  —  kuxx •  (l*-0 

Observe  that  the  subscript  notation  is  used  to  indicate  partial  differentiation, 
and  we  rarely  write  the  independent  variables,  preferring  u  to  u(x,  t ).  The  equa¬ 
tion  states  that  the  partial  derivative  of  the  temperature  with  respect  to  t  must 
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equal  the  second  partial  derivative  of  the  temperature  with  respect  to  x,  multi¬ 
plied  by  a  constant  k.  The  constant  /c,  called  the  diffusivity,  is  a  known  param¬ 
eter  and  a  property  of  the  bar;  it  can  be  determined  in  terms  of  the  density, 
specific  heat,  and  thermal  conductivity  of  the  metal.  Values  for  these  physical 
constants  for  different  materials  can  be  found  in  handbooks  or  online.  Later  we 
observe  that  (1.1)  comes  from  a  basic  physical  law  (energy  conservation)  and 
an  empirical  observation  (Fourier’s  heat  conduction  law).  The  conditions  that 
the  end  faces  of  the  bar  are  maintained  at  zero  degrees  can  be  expressed  by  the 
equations 

u(0,£)  =  0,  u(l,t)  =  0,  t  >  0,  (1.2) 

which  are  called  boundary  conditions  because  they  impose  conditions  on  the 
temperature  at  the  boundary  of  the  spatial  domain.  The  stipulation  that  the 
bar  initially  has  a  fixed  temperature  (j){x)  degrees  across  its  length  is  expressed 
mathematically  by 

u(x,  0)  =  </>(#),  0  <  x  <  l.  (1.3) 

This  condition  is  called  an  initial  condition  because  it  specifies  the  state 
variable  at  time  t  —  0.  The  entire  set  of  equations  (1.1) — (1.3) — the  PDE  and 
the  auxiliary  conditions — form  the  mathematical  model  for  heat  flow  in  the 
bar.  Such  a  model  in  the  subject  of  PDEs  is  called  an  initial  boundary  value 
problem.  The  invention  and  analysis  of  such  models  are  the  subjects  of  this 
book.  □ 

In  this  heat  flow  model,  the  state  variable  u ,  the  temperature,  depends  upon 
two  independent  variables,  a  time  variable  t  and  a  spatial  variable  x.  Such  a 
model  is  an  evolution  model.  Some  physical  systems  do  not  depend  upon 
time,  but  rather  only  upon  spatial  variables.  Such  models  are  called  steady 
state  or  equilibrium  models.  For  example,  if  17  is  a  bounded,  two-dimensional 
spatial  domain  representing  a  planar,  laminar  plate,  and  on  the  boundary  of 
17,  denoted  by  Sl7,  there  is  imposed  a  given,  time-independent  temperature, 
then  the  steady-state  temperature  distribution  u  =  u(x,  y )  inside  17  satisfies 
the  Laplace  equation,  a  partial  differential  equation  having  the  form 

^ xx  T  ^ yy  0?  (-L  V )  F  17.  (1*4) 

If  we  denote  the  fixed  boundary  temperature  by  f(x,y),  then  (1.4)  along  with 
the  boundary  condition 


u{x,  y)  =  f(x,  y),  (x,y)€dfi,  (1.5) 

is  an  equilibrium  model  for  temperatures  in  the  plate.  In  PDEs  these  spatial 
models  are  called  boundary  value  problems.  Solving  Laplace’s  equation 
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(1.4)  in  a  region  ft  subject  to  a  given  condition  (1.5)  on  the  boundary  is  a 
famous  problem  called  the  Dirichlet  problem. 

In  general,  a  second-order  evolution  PDE  in  one  spatial  variable  and  time 
is  an  equation  of  the  form 

G (x,  t,  it,  uXl  Uxx')  'U'tti  'U'xt)  —  0,  x  £  /,  t  0,  (1.6) 

where  I  is  a  given  spatial  interval,  which  may  be  a  bounded  or  unbounded. 
The  equation  involves  an  unknown  function  u  =  u(x,  £),  the  state  variable,  and 
some  of  its  partial  derivatives.  The  order  of  a  PDE  equation  is  the  order  of 
the  highest  derivative  that  occurs.  The  PDE  is  almost  always  supplemented 
with  initial  and/or  boundary  conditions  that  specify  the  state  u  at  time  t  =  0 
and  on  the  boundary.  One  or  more  parameters,  which  are  not  explicitly  shown, 
may  also  occur  in  (1.6). 

PDEs  are  classified  according  to  their  order  and  other  properties.  For  exam¬ 
ple,  as  is  the  case  for  ODEs,  they  are  classified  as  linear  or  nonlinear.  Equation 
(1.6)  is  linear  if  G  is  a  linear  function  in  u  and  in  all  of  its  derivatives;  how 
the  independent  variables  x  and  t  appear  is  not  relevant.  This  means  that  the 
unknown  u  and  its  derivatives  appear  alone  and  to  the  the  first  power.  Other¬ 
wise,  the  PDE  is  nonlinear.  A  linear  equation  is  homogeneous  if  every  term 
contains  u  or  some  derivative  of  u.  It  is  nonhomogeneous  if  there  is  a  term 
depending  only  on  the  independent  variables,  t  and  x. 

Example  1.2 

Both  second-order  equations 

ut  +  uuxx  =  0  and  utt  —  ux  +  sin  u  =  0 

are  nonlinear,  the  first  because  of  the  product  uuxx  and  the  second  because  the 
unknown  u  is  tied  up  in  the  nonlinear  sine  function.  The  second-order  equation 

Ut  —  sin  (x2t)uxt  =  0 

is  linear  and  homogeneous,  and  the  equation 

Ut  +  3  xuxx  =  tx 2 

is  linear  and  nonhomogeneous.  □ 

In  many  discussions  it  is  convenient  to  introduce  operator  notation.  For 
example,  we  can  write  the  heat  equation 


Ut  —  kuxx  =  0 
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as 


Lu  =  0  where  L  =  — 

dt 


k 


d 2 

dx 2 


Here  L  is  a  differential  operator,  and  we  write  its  action  on  a  function  u  as 
as  either  Lu  or  L(u).  It  acts  on  twice  continuously  differentiable  functions 
u  =  u(x,  t )  to  produce  a  new  function.  We  say  a  differential  operator  L  is 
linear  if,  and  only  if,  it  satisfies  the  two  conditions 


L(u  +  v)  =  Lu  +  Lv,  L{cu)  =  cLu 

for  all  functions  u  and  v,  and  all  constants  c.  If  L  is  a  linear,  then  the  equation 
Lu  =  0  is  said  to  be  homogeneous,  and  the  equation  Lu  =  /  is  nonhomo- 

geneous. 

One  cannot  overstate  the  significance  of  the  partition  of  PDEs  into  the  two 
categories  of  linear  and  nonlinear.  Linear  equations  have  algebraic  structure  to 
their  solution  sets:  the  sum  of  two  solutions  to  a  homogeneous  linear  equation  is 
again  a  solution,  as  are  constant  multiples  of  solutions.  Another  way  of  saying 
this  is  that  solutions  superimpose.  Thus,  if  rq,  rq,  . . . ,  un  are  solutions  to 
Lu  =  0,  and  c\,  C2r  . .  ,cn  are  constants,  then  the  linear  combination 


c\U\  +  c2U2  H - h  cnun 

is  also  a  solution  to  Lu  =  0.  As  we  see  later,  this  superposition  principle 
extends  in  many  cases  to  infinite  sums  and  even  to  a  continuum  of  solutions. 
For  example,  if  u(x,  t,  £)  is  a  one-parameter  family  of  solutions  to  Lu  =  0,  for 
all  £  in  an  interval  J,  then  we  can  often  prove 

J  c(£)u(x,t,£)d£ 

is  a  solution  to  Lu  =  0  for  special  conditions  on  the  distributed  ‘constants’ 
(i.e. ,  the  function)  c(£).  These  superposition  principles  are  essential  in  this 
text.  Every  concept  we  use  involves  superposition  in  one  way  or  another. 

Another  result  based  on  linearity  is  that  the  real  and  imaginary  parts  of 
a  complex-valued  solution  re  to  a  homogeneous  differential  equation  Lw  =  0 
are  both  real  solutions.  Specifically,  if  w  is  complex-valued  function,  then  w  = 
u  +  in,  where  u  =  Re  re  and  v  =  Imre  are  real- valued  functions.  Then,  by 
linearity, 

Lw  =  L(u  +  iv)  =  Lu  +  iLv  =  0. 

This  implies  Lu  =  0  and  Lv  =  0,  because  if  a  complex  function  is  indent ically 
zero  then  both  its  real  and  imaginary  parts  are  zero. 

Nonlinear  equations  do  not  share  these  properties.  Nonlinear  equations  are 
harder  to  solve,  and  their  solutions  are  more  difficult  to  analyze.  Even  when 
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nature  presents  us  with  a  nonlinear  model,  we  often  approximate  it  with  a  more 
manageable  linear  one. 

Equally  important  in  classifying  PDEs  is  the  specific  nature  of  the  physi¬ 
cal  phenomena  that  they  describe.  For  example,  a  PDEs  can  be  classified  as 
wave-like ,  diffusion-like ,  or  equilibrium ,  depending  on  whether  it  models  wave 
propagation,  a  diffusion  process,  or  an  equilibrium  state.  For  example,  Laplace’s 
equation  (1.4)  is  a  second-order,  linear  equilibrium  equation;  the  heat  equation 
(1.1)  is  a  second-order,  linear  diffusion  equation  because  heat  flow  is  a  diffusion 
process.  In  the  last  section  of  this  chapter  we  give  a  more  precise,  mathematical 
characterization  of  these  properties. 

By  a  solution  to  the  PDE  (1.6)  we  mean  a  function  u  =  u(x,t)  defined 
on  the  space-time  domain  t  >  0,  x  G  /,  that  satisfies,  upon  substitution, 
the  equation  (1.6)  identically  on  that  domain.  Implicit  in  this  definition  is  the 
stipulation  that  u  possess  as  many  continuous  partial  derivatives  as  required  by 
the  PDE.  For  example,  a  solution  to  a  second-order  equation  should  have  two 
continuous  partial  derivatives  so  that  it  makes  sense  to  calculate  the  derivatives 
and  substitute  them  into  the  equation.  Whereas  the  general  solution  to  an  ODE 
involves  arbitrary  constants,  the  general  solution  to  a  PDE  involves  arbitrary 
functions.  Sometimes  the  general  solution  to  a  PDE  can  be  found,  but  it  is 
usually  not  necessary  to  have  it  to  solve  most  problems  of  interest. 

Example  1.3 

One  should  check,  by  direct  substitution,  that  both  functions 

ui(x,  t)  =  x2  +  2t  and  U2(x,  t)  =  e~l  sin  a? 
are  solutions  to  the  heat  equation 

u+  —  uxx  =  0. 

There  are  many  other  solutions  to  this  equation.  Auxiliary  conditions,  like 
initial  and  boundary  conditions,  generally  single  out  the  appropriate  solution 
to  a  problem.  □ 

Example  1.4 

Consider  the  first-order,  linear,  nonhomogeneous  PDE 

ux  =  t  sin  ax 

This  equation  can  be  solved  by  direct  integration.  We  integrate  with  respect  to 
ay  holding  t  fixed,  to  get 


u(x,  t )  =  —tcosx  +  ^(t), 
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where  ip  is  an  arbitrary  function  of  t.  In  PDEs,  integration  with  respect  to  one 
variable  produces  an  arbitrary  function  of  the  other  variable,  not  an  arbitrary 
constant  as  in  one-dimensional  calculus.  This  last  equation  defines  the  general 
solution.  One  can  check  that  it  is  a  solution  for  any  differentiable  function 
t ).  Usually,  PDEs  have  arbitrary  functions  in  the  expression  for  their  general 
solutions;  the  number  of  such  functions  often  agrees  with  the  order  of  the 
equation.  □ 

Example  1.5 

The  second-order  PDE  for  u  =  u(x,  t), 

Utt  —  4  a  =  0 

is  just  an  like  an  ODE  with  x  as  a  parameter.  So  the  ‘constants’  depend  on  x. 
The  solution  is 

u(x,t)  =  (j){x)e~2t  +  x)e2t , 
where  f>  and  ip  are  arbitrary  functions  of  x.  □ 


u  A 


x 

Figure  1.2  A  solution  surface  u  =  u(x,t).  A  cross-section  u(x,to)  of  the 
surface  at  time  to  is  interpreted  as  a  wave  profile  at  t  =  to 


Geometrically,  a  solution  u  =  u(x,  t)  can  be  thought  of  as  a  surface  in  xtu- 
space.  Refer  to  Figure  1.2.  The  surface  lies  over  the  space-time  domain:  x  E  /, 
t  >  0.  Alternately,  one  could  regard  the  solution  as  a  continuous  sequence  of 
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time  snapshots.  That  is,  for  each  fixed  time  to,  u(xfio)  is  a  function  of  x  alone 
and  thus  represents  a  time  snapshot  of  the  solution.  In  different  words,  u(x,  to) 
is  the  trace  of  the  solution  surface  u  =  u(x,  t)  taken  in  the  t  =  to-plane.  In 
some  contexts,  u(xfio)  is  interpreted  as  a  wave  profile ,  or  signal,  at  time  to-  In 
this  way  a  solution  u(xfi)  of  (1.6)  can  be  regarded  a  continuous  sequence  of 
evolving  wave  forms  evolving  in  time. 

Bibliographic  Notes.  There  are  dozens  of  excellent  elementary  PDE  books 
written  at  about  the  same  level  as  this  one.  We  especially  mention  Farlow 
(1993)  and  Strauss  (1992).  A  more  advanced  treatment  is  given  by  McOwen 
(2003).  Nonlinear  PDEs  at  the  beginning  level  are  treated  in  detail  in  Debnath 
(1997)  or  Logan  (2008).  PDE  models  occur  in  every  area  of  the  pure  and  applied 
sciences.  General  texts  involving  modeling  in  engineering  and  science  are  Lin 
&  Segel  (1989),  Holmes  (2011),  and  Logan  (2013). 


EXERCISES 


1.  Verify  that  a  solution  to  the  heat  equation  (1.1)  on  the  domain  —  oo  <  x  < 
oo,  t  >  0  is  given  by 


u(x,  t ) 


1 


V47r kt 


-x2 /4kt 


For  a  fixed  time,  the  reader  should  recognize  this  solution  as  a  bell-shaped 
curve,  (a)  Pick  k  =  0.5.  Use  software  to  sketch  several  time  snapshots  on 
the  same  set  of  coordinate  axes  to  show  how  the  temperature  profile  evolves 
in  time,  (b)  What  do  the  temperature  profiles  look  like  as  t  0?  (c)  Sketch 
the  solution  surface  u  =  u(pc,t)  in  a  domain  —2  <  x  <  2,  0.1  <  t  <  4.  (d) 
How  does  changing  the  parameter  k  affect  the  solution? 

2.  Verify  that  u(x,y)  =  In  x2  +  y2  satisfies  the  Laplace  equation 


^ xx  T  'Uyy  —  0 


for  all  (x,  y)  ^  (0,  0). 

3.  Find  the  general  solution  of  the  equation  uxy{x,y)  =  0  in  terms  of  two 
arbitrary  functions. 

4.  Derive  the  solution  u  =  u(x,  y )  =  axy  +  bx  +  cy  +  d  (a,  5,  c,  d  constants), 
of  the  PDE 

yfi  4-  yfi  —  Q 
^ xx  1  yy 

Observe  that  the  solution  does  not  explicitly  contain  arbitrary  functions. 


10 


1.  The  Physical  Origins  of  Partial  Differential  Equations 


5.  Find  a  function  u  =  u{pc,t)  that  satisfies  the  PDE 

uxx  =  0,  0  <  x  <  1,  t  >  0, 
subject  to  the  boundary  conditions 

'u(0,t)  =  t2,  u(l,t)  =  l,  t  >  0. 


6.  Verify  that 


u(x,  t) 


1 

2c 


'X-\-ct 


g(s)ds 


x—ct 


is  a  solution  to  the  wave  equation  uu  =  c2uxx ,  where  c  is  a  constant  and  g  is 
a  given  continuously  differentiable  function.  Hint:  Here  you  will  need  to  use 
Leibniz’s  rule  for  differentiating  an  integral  with  respect  to  a  parameter 
that  occurs  in  the  limits  of  integration: 


d  fbW 

f-  /  F(s)ds  =  F(b(t))b'(t)  -  F(a(t))a'(t) 
dt  Ja(t ) 


7. 


For  what  values  of  a  and  b  is  the  function  u(x,  t)  =  eat  sin  bx  a  solution  to 
the  heat  equation 


ut 


ku 


nr*  nr*  • 


8.  Find  the  general  solution  to  the  equation  uxt  +  3 ux  =  1.  Hint:  Let  v  —  ux 
and  solve  the  resulting  equation  for  v\  then  find  u. 

9.  Show  that  the  nonlinear  equation  ut  =  u2  +uxx  can  be  reduced  to  the  heat 
equation  (1.1)  by  changing  the  dependent  variable  to  w  =  eu . 

10.  Show  that  the  function  u(x,  y )  =  arctan (y/x)  satisfies  the  two-dimensional 
Laplace’s  equation  u 

oc  oc  -j-  Uyy  —  0. 

11.  Show  that  e_^ysin(<^x),  x  G  R,  y  >  0,  is  a  solution  to  uxx  +  uyy  =  0  for 
any  value  of  the  parameter  Deduce  that 

nOO 

u(x,y)=  c({)e~fy  sin(£x)d£ 

Jo 

is  a  solution  to  the  same  equation  for  any  function  c(£)  that  is  bounded 
and  continuous  on  [0,  oc).  Hint:  The  hypotheses  on  c  allow  you  to  bring 
a  derivative  under  the  integral  sign.  [This  exercise  shows  that  taking  inte¬ 
grals  of  solutions  sometimes  gives  another  solution;  integration  is  a  way  of 
superimposing,  or  adding,  a  continuum  of  solutions.] 
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12.  Linear,  homogeneous  PDEs  with  constant  coefficients  admit  complex  solu¬ 
tions  of  the  form 

u(x,t)  =  Aeiikx-ut\ 

which  are  called  plane  waves.  The  real  and  imaginary  parts  of  this  com¬ 
plex  function, 

Re(T)  =  Acos(kx  —  cet),  Im('u)  =  Asin(kx  —  out), 

give  real  solutions.  The  constant  A  is  the  amplitude,  k  is  the  wave  num¬ 
ber,  and  uj  is  the  temporal  frequency.  When  the  plane  wave  form  is 
substituted  into  a  PDE  there  results  a  dispersion  relation  of  the  form 

uj  =  o;(fc), 

which  states  how  the  frequency  depends  upon  the  wave  number.  For  the 
following  PDEs  find  the  dispersion  relation  and  determine  the  resulting 
plane  wave;  sketch  wave  profiles  at  different  times. 

a)  Ut  =  Duxx. 

b)  Utt  —  c  uxx . 

c)  Uf  T  uxxx  —  0. 

d)  ut  =  iuxx.  (Here,  i  is  the  complex  number  i2  =  —1.) 

e)  ut  +  cux  =  0. 

13.  Second-order  linear  homogeneous  equations  with  constant  coefficients  are 
often  classified  by  their  dispersion  relation  uj  =  uu(k)  (see  Exercise  12). 
If  uj{k)  is  complex,  the  PDE  is  called  diffusive,  and  if  uu(k)  is  real  and 
uj" (k)  7^  0,  the  PDE  is  called  dispersive.  The  diffusion  equation  is  dif¬ 
fusive;  the  wave  equation  is  neither  diffusive  or  dispersive.  The  term  dis¬ 
persive  means  that  the  speed  u(k)/k  of  a  plane  wave  u  =  Ael(kx~UJ(kW 
travels  depends  upon  the  wave  number  k.  So  waves  of  different  wavelength 
travel  at  different  speeds,  and  thus  disperse.  Classify  the  PDEs  in  (a)-(e) 
of  Exercise  12  according  to  this  scheme. 

14.  Find  plane  wave  solutions  to  the  Kuromoto-Sivashinsky  equation 

ut  =  —u  —  Suxx  —  uxxxx,  S  >  0. 

Find  the  dispersion  relation  and  classify  the  equation  according  to  the 
scheme  of  the  preceding  exercise.  Describe  the  solutions  and  plot  S  as  a 
function  of  the  wave  number  k  to  determine  when  the  growth  rate  of  a 
solution  is  zero.  For  which  wave  numbers  will  the  solution  decay? 
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1.2  Conservation  Laws 

Many  PDEs  come  from  a  basic  balance,  or  conservation  law.  A  conservation 
law  is  a  mathematical  formulation  of  the  fact  that  the  rate  at  which  a  quantity 
changes  in  a  given  domain  must  equal  the  rate  at  which  the  quantity  flows 
across  the  boundary  (in  minus  out)  plus  the  rate  at  which  the  quantity  is 
created  within  the  domain.  For  example,  consider  a  population  of  a  certain 
animal  species  in  a  fixed  geographical  region.  The  rate  of  change  of  the  animal 
population  must  equal  the  rate  at  which  animals  migrate  into  the  region,  minus 
the  rate  at  which  they  migrate  out,  plus  the  birth  rate,  minus  the  death  rate. 
Such  a  statement  is  a  verbal  expression  of  a  balance,  or  conservation,  law.  One 
can  make  similar  kinds  of  statements  for  many  quantities — energy,  the  mass  of 
a  chemical  species,  the  number  of  automobiles  on  a  freeway,  and  so  on. 


x  a  b  x 

Figure  1.3  Tube  with  cross-sectional  area  A  shown  with  arbitrary  cross- 
section  at  x  (shaded).  The  lateral  sides  are  insulated,  and  the  physical  quantities 
vary  only  in  the  x-  direct  ion  and  in  time.  All  quantities  are  constant  over  any 
cross-section 


To  quantify  such  statements  we  require  some  notation.  Let  the  state  variable 
u  =  u(x,t)  denote  the  density  of  a  given  quantity  (mass,  energy,  animals, 
automobiles,  etc.);  density  is  usually  measured  in  amount  per  unit  volume,  or 
sometimes  amount  per  unit  length.  For  example,  energy  density  is  measured  in 
energy  units  per  volume.  We  assume  that  any  variation  in  the  state  be  restricted 
to  one  spatial  dimension.  That  is,  we  assume  a  one-dimensional  domain  (say,  a 
tube,  as  in  Figure  1.3  where  each  cross-section  is  labeled  by  the  spatial  variable 
x;  we  require  that  there  be  no  variation  of  u(x,  t )  within  the  cross-section  at 
x.  Implicit  is  the  assumption  that  the  quantity  in  the  tube  is  abundant  and 
continuous  enough  in  x  so  that  it  makes  sense  to  define  its  density  at  each 
section  of  the  tube.  The  amount  of  the  quantity  in  a  small  section  of  width  dx 
is  u(x,  t)Adx ,  where  A  is  the  cross-sectional  area  of  the  tube.  Further,  we  let 
4>  =  </>(#,  t)  denote  the  flux  of  the  quantity  at  x,  at  time  t.  The  flux  measures 
the  amount  of  the  quantity  crossing  the  section  at  x  at  time  t,  and  its  units 
are  given  in  amount  per  unit  area,  per  unit  time.  Thus,  Acj)(x ,  t)  is  the  actual 
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amount  of  the  quantity  that  is  crossing  the  section  at  x  at  time  t.  By  convention, 
flux  is  positive  if  the  flow  is  to  the  right,  and  negative  if  the  flow  is  to  the  left. 
Finally,  let  /  =  /(x,  t)  denote  the  given  rate  at  which  the  quantity  is  created,  or 
destroyed,  within  the  section  at  x  at  time  t.  The  function  /  is  called  a  source 
term  if  it  is  positive,  and  a  sink  if  it  is  negative;  it  is  measured  in  amount 
per  unit  volume  per  unit  time.  Thus,  f(x,t)Adx  represents  the  amount  of  the 
quantity  that  is  created  in  a  small  width  dx  per  unit  time. 

A  conservation  law  is  a  quantitative  relation  between  u ,  </>,  and  /.  We  can 
formulate  the  law  by  considering  a  fixed,  but  arbitrary,  section  a  <  x  <  b  of 
the  tube  (Figure  1.3)  and  requiring  that  the  rate  of  change  of  the  total  amount 
of  the  quantity  in  the  section  must  equal  the  rate  at  which  it  flows  in  at  x  =  a, 
minus  the  rate  at  which  it  flows  out  at  x  =  6,  plus  the  rate  at  which  it  is  created 
within  a  <  x  <  b.  In  mathematical  symbols, 


d_ 

dt 


u(x,  t)Adx 


A(j)(a,t)  —  A(j)(b,t)  +  f  f(x,t)Adx. 

J  a 


(1.7) 


This  equation  is  the  fundamental  conservation  law;  it  is  an  integral  expression 
of  the  basic  fact  that  there  must  be  a  balance  between  how  much  goes  in,  how 
much  goes  out,  and  how  much  is  changed.  Because  A  is  constant,  it  may  be 
canceled  from  the  formula. 

Equation  (l.T)  is  an  integral  law.  However,  if  the  functions  u  and  </>  are 
sufficiently  smooth,  then  it  may  be  reformulated  as  a  PDE,  which  is  a  local 
law.  For  example,  if  u  has  continuous  first  partial  derivatives,  then  the  time 
derivative  on  the  left  side  of  (1.7)  may  be  brought  under  the  integral  sign  to 
obtain 


d_ 

dt 


u(x ,  t)dx 


Ut(x,  t)dx. 


If  (j)  has  continuous  first  partials,  then  the  fundamental  theorem  of  calculus  can 
be  applied  to  write  the  change  in  flux  as  the  integral  of  a  derivative,  or 


4>{a,t)  -  (f>(b,t )  =  -  (px(x,t)dx 

J  a 


Therefore,  (1.7)  may  be  written 


/  (ut(x,t)  +  <px(x,t)  -  f(x,t))dx 

J  a 


=  0. 


Because  a  <  x  <  b  can  be  any  interval  whatsoever,  and  because  the  integrand 
is  continuous,  it  follows  that  the  integrand  must  vanish  identically,  or 


ut(x,  t )  +  <j)x(x,  t )  =  f(x,  t). 


(1.8) 
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Equation  (1.8)  is  a  local  version  of  (1.7),  obtained  under  the  assumption  that  u 
and  /  are  continuously  differentiable;  it  is  a  PDE  model  describing  the  relation 
between  the  density  the  quantity,  its  flux,  and  the  rate  at  which  the  quantity 
is  created.  We  call  the  PDE  (1.8)  the  fundamental  conservation  law.  The 
/-term  is  called  the  source  term,  and  the  /-term  is  called  the  flux  term.  In 
(1.8)  we  usually  drop  the  understood  notational  dependence  on  x  and  t  and 
just  write  ut  +  (j)x  =  /  for  simplicity. 

Before  studying  some  examples,  we  make  some  general  comments.  The  flux 
/  and  source  /  are  functions  of  x  and  t,  but  their  dependence  on  x  and  t  may  be 
through  dependence  upon  the  density  u  itself.  For  example,  the  source  term  / 
may  be  given  as  a  function  of  density  via  /  =  /(u),  where,  of  course,  u  =  u(x,  t). 
Similarly,  /  may  depend  on  u.  These  dependencies  lead  to  nonlinear  models. 
Next,  we  observe  that  (1.8)  is  a  single  equation,  yet  there  are  two  unknowns,  u 
and  /  (the  form  of  the  source  /  is  assumed  to  be  prescribed).  This  implies  that 
another  equation  is  required  that  relates  u  and  /.  Such  equations  are  called 
constitutive  relations  (or  equations  of  state),  and  they  arise  from  physical 
assumptions  about  the  medium  itself. 

The  Method  of  Characteristics 

In  this  section,  in  the  context  of  the  advection  of  materials  through  a 
medium,  we  introduce  the  basic  method  for  solving  first  order  PDEs,  the 
method  of  characteristics. 

Example  1.6 

(Advection)  A  model  where  the  flux  is  proportional  to  the  density  itself,  that 

is, 

/  =  cu, 

where  c  is  a  constant,  is  called  an  advection  model.  Notice  that  c  must  have 
velocity  units  (length  per  time).  In  this  case  the  conservation  law  (1.8)  becomes, 
in  the  absence  of  sources  (/  =  0), 

ut  +  cux  =  0.  (1.9) 

Equation  (1.9)  is  called  the  advection  equation.  The  reader  should  verify, 
using  the  chain  rule,  that  the  function 

u(x,  t )  =  F(x  —  ct )  (1.10) 

is  a  solution  to  (1.9)  for  any  differentiable  function  F.  Such  solutions  (1.10) 
are  called  right- traveling  waves  because  the  graph  of  F(x  —  ct)  is  the  graph 
of  F(x)  shifted  to  the  right  ct  spatial  units.  So,  as  time  t  increases,  the  wave 
profile  F{pc)  moves  to  the  right,  undistorted,  with  its  shape  unchanged,  at  speed 
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c.  Figure  1.4  shows  two  ways  of  viewing  a  right-traveling  wave.  Intuitively,  (1.9) 
describes  what  we  usually  call  advection.  For  example,  a  density  cloud  of  smoke 
carried  by  the  bulk  motion  of  the  wind  would  represent  an  advection  process. 
Other  common  descriptive  terms  for  this  kind  of  movement  are  transport  and 
convection.  □ 

Remark.  The  function  u(x,t)  =  F(z),  z  =  x  —  ct,  where  F  is  an  arbitrary 
function,  is  called  the  general  solution  of  the  advection  equation  ut  +  cux  =  0, 
c  >  0.  So  the  general  solution  is  a  right  traveling  wave.  □ 


Figure  1.4  Two  views  of  a  traveling  wave:  (a)  wave  snapshots  ( profiles )  at 
two  different  times,  and  (b)  moving  forward  in  space-time 


Remark.  If  the  flux  is  a  nonlinear  function  of  the  density,  that  is,  <fi  =  </>(u), 
then  the  conservation  law  (1.8)  (again  take  /  =  0)  takes  the  form 

Ut  +  <j)(u)x  =  Ut  +  <j)' (u)ux  =  0.  (1.11) 

If  (j)(u)  is  not  linear  in  n,  then  (1.11)  is  a  model  of  nonlinear  advection, 
and  such  models  are  more  difficult  to  analyze.  Later  in  this  section  we  examine 
simple  nonlinear  models.  Logan  (2008,  2013)  can  be  consulted  for  an  detailed 
treatment  of  nonlinear  equations.  □ 

Example  1.7 

(Advection  and  decay)  Recall  from  elementary  differential  equations  that 
decay  (e.g.,  radioactive  decay)  is  modeled  by  the  law  du/dt  =  —  An,  where  A  is 
the  decay  rate.  Thus,  a  substance  advecting  through  a  tube  at  positive  velocity 
c  (for  example,  a  radioactive  chemical  dissolved  in  water  flowing  at  speed  c)  is 
modeled  by  the  advection-decay  equation 


Ut  T  cux 


Xu. 


(1.12) 
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Here,  f  =  —Xu  is  the  source  term  (specifically,  the  decay  term)  and  <fi  =  cu  is 
the  flux  term  in  the  conservation  law  (1.8).  □ 

Example  1.8 

The  pure  initial  value  problem  for  the  advection  equation  is 

ut  +  cux  =  0,  >  0,  (1.13) 

u(x,  0)  =  uo(x),  x  G  R,  (1.14) 

where  uq(x)  is  a  given  initial  density,  or  signal.  From  (1.10)  it  follows  that  the 
solution  to  (1.13)— (1.14)  is 


u(x,  t )  =  uq(x  —  ct). 

Physically,  the  initial  density  signal  moves  to  the  right  at  speed  c.  Alternatively, 
we  think  of  the  density  signal  moving  along  the  family  of  parallel  straight  lines 
£  =  x  —  ct  =  constant  in  space-time.  These  lines,  called  characteristics,  are 
the  curves  that  carry  the  signal  For  the  pure  advection  equation,  the  solution 
moves  in  such  a  way  that  the  strength  u  of  the  density  remains  constant  along 
any  characteristic  curve.  □ 

Now  we  solve  a  general  advection  equation  of  the  form 

Ut  +  cux  +  au  =  /(x,  £),  (1.15) 

where  a  and  c  are  constants  and  /  is  a  given  function.  Because  the  advection 
equation  propagates  signals  at  speed  c,  it  is  reasonable  to  transform  this  equa¬ 
tion  to  a  moving  coordinate  system.  Thus,  let  £  and  r  be  new  independent 
variables,  called  characteristic  coordinates,  defined  by 

£  =  x  —  ct,  r  =  t. 

We  think  of  £  as  a  moving  coordinate  that  travels  (or  advects)  with  the  signal.  If 
we  denote  u(x,  t)  in  the  new  variables  by  U (£,  r)  (that  is,  U (£,  r)  =  ia(^  +  ct,  t), 
or  u(x,  t )  =  U(x  —  ct ,  £)),  then  the  chain  rule  gives 

Ut  =  U^t  +  UrTt  =  —cU^  +  UT 

and 

^ x  —  T  Uttx  — 

So  equation  (1.15)  becomes 


UT  +  aU  =  F(£,  t), 
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where  F(£,r)  =  /(£  +  cr,  r).  This  PDE  contains  derivatives  with  respect  to 
only  one  of  its  independent  variables  and  therefore  can  be  regarded  as  an  ODE 
with  the  other  independent  variable  as  a  parameter.  Thus  it  can  be  solved  by 
ODE  methods,  which  are  reviewed  in  the  Appendix.  It  has  the  form  of  a  linear 
equation,  and  so  it  can  be  solved  by  multiplying  by  the  integrating  factor  ear 
and  integrating  with  respect  to  r.  An  example  illustrates  this  procedure. 

Example  1.9 

Find  the  general  solution  of 


Ut  +  2  ux  —  u  =  t. 


Let  £  =  x  —  2t,  r  =  t.  In  these  characteristic  coordinates  the  equation  becomes 

UT  ~U  =  T. 

Multiplying  by  e~T  gives 

^L{Ue-T)=Te-T. 

Integrating, 

Ue~T  =  J  re~Tdr  =  —(1  +  r)e~T  +  g(£), 

where  g  is  an  arbitrary  function.  Transforming  back  to  xt  variables  then  gives 
the  general  solution 

u(x,  t )  =  —(1  +  t)  +  g(x  —  2 t)et.  □ 

Remark.  A  more  general  reaction-advection  PDE 

ut  +  cux  =  f(x,t,u), 

where  the  source  term  depends  on  u ,  can  in  principle  be  solved  by  making  the 
same  transformation  £  =  x  —  ct,  r  =  t  to  turn  it  into  a  simpler  equation  of 
the  form 

Ut  =  F&t,  U). 

In  these  characteristic  coordinates  the  PDE  simplifies  to  the  form  of  an  ODE 
with  only  one  derivative.  □ 

The  important  point  in  the  preceding  discussion  is  that  the  advection  oper¬ 
ator  J7  +  simplifies  to  ^  in  characteristic  coordinates;  thus,  changing 
independent  variables  is  a  strategy  for  handling  equations  having  advection 
operators.  This  solution  technique  is  called  the  method  of  characteristics. 
A  similar  characteristic  method  can  be  applied  to  solve  the  equation 


ut  +  c{x,t)ux  =  f(x,t,u). 
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In  this  case,  we  think  of  c(x,  t )  as  the  advection  speed  in  a  heterogeneous 
medium;  it  replaces  the  constant  c  in  the  previous  problem  and  now  depends 
on  the  location  in  the  medium  and  on  time.  The  characteristic  coordinates  are 
given  by  £  =  £(x,£),  r  =  £,  where  £(x,£)  =  C  is  the  general  solution  of  the 


ODE 


dx 

dt 


—  c(x,  t). 


In  these  new  coordinates  we  see  that  the  original  PDE  transforms  into  an 
equation  of  the  form 

Ut  =  F(Z,t,U), 


where  U  =  [/(£,  r).  (Verify  this.)  In  theory  this  equation  can  be  solved  for  U 
and  then  we  can  substitute  for  £  and  r  in  terms  of  x  and  t  to  obtain  u  =  u(x,  t). 


Example  1.10 

Consider  the  PDE 

ut  +  2  tux  =  0. 

Here,  c(x,  t)  =  2 1.  Setting  ^|  =  2 1  and  solving  gives  x—t2  =  C.  Thus,  £  =  x—t2. 
The  characteristic  coordinates  are 


£  =  X  -  t2,  T  =  t, 

and  we  find  by  the  chain  rule  that 

Ut  =  U^(—2t)  +  I7t,  ux  =  U^. 

Therefore  Ut  +  2 tux  =  UT  and  the  original  PDE  transforms  into  UT  =  0.  Hence 
U  =  </(£),  where  g  is  an  arbitrary  function.  The  general  solution  to  the  given 
PDE  is  thus  u(x,  t)  =  g{x  —  t2).  Observe  that  the  solution  is  constant  along  the 
set  of  characteristic  curves  (parabolas  in  space-time)  x  —  t2  =  C.  □ 

Example  1.11 

We  solve  the  advection  equation  in  the  first  quadrant  with  both  initial  and 
boundary  conditions.  Consider  the  equation 

Ut  +  2 ux  =  0,  x  >  0,  t  >  0, 

subject  to  the  initial  and  boundary  conditions 

u(x,  0)  =  tx(0,  t)  =  (1  +  t2)-1. 

We  know  the  general  solution  is  u(x,t)  =  F(x  —  2 1),  where  F  is  arbitrary. 
The  idea  is  to  let  the  PDE  carry  the  boundary  signals  into  the  region;  so 
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we  determine  the  arbitrary  function  F  separately  in  x  >  2 1  and  in  x  <  2 1. 
The  separating  characteristic  x  =  2t  is  called  the  leading  signal.  For  x  >  2t, 
ahead  of  the  leading  signal,  we  apply  the  initial  condition  at  u(x,  0)  because 
the  characteristics  in  that  region  come  from  the  x-axis: 

u(x,  0)  =  F(x)  =  e~x . 


Then 


u(x,t)  =  e  ^ x  2t\  x  >  2t. 

In  the  domain  0  <  x  <  2t  we  apply  the  boundary  condition  at  u( 0,  t)  because 
the  characteristics  in  that  region  come  from  the  t-axis: 


u(0,  t)  =  F(—2t) 


1 

lTt*' 


To  determine  the  form  of  F  let  s  =  —2 1.  Then  t  =  —s/2  and 


F(s) 


1 

1  +  s2/4’ 


Therefore,  the  solution  in  x  <  2t  is 


u(x,  t) 


1 

1  +  (x  —  2t)2/ 4  ’ 


0  <  x  <  2t. 


Notice  that  the  solution  is  continuous  along  the  leading  characteristic  x  =  2 1, 
but  the  derivatives  have  discontinuities,  giving  a  non-smooth  solution.  This 
phenomenon  is  common  for  first-order  PDEs.  Discontinuities  are  carried  along 
the  characteristics.  □ 


In  Section  1.4  there  is  an  expanded  treatment  of  advection  in  a  biological 
context. 

Nonlinear  Advection* 

In  the  last  last  few  pages  we  studied  two  simple  model  advection  equations, 
Ut  +  cux  =  0  and  ut  +  c(x,t)ux  =  0.  Both  are  first-order  and  linear.  Now 
we  study  the  same  type  of  equation  when  a  nonlinear  nonlinear  flux  (j)(u)  is 
introduced.  Then  the  conservation  law  becomes 


ut  +  <j>{u)x  =  0. 

Using  the  chain  rule  we  find  (j)(u)x  =  </>' {u)ux .  Denoting  c(u)  =  </>' (u)  gives, 
after  appending  an  initial  condition,  the  IVP 

ut  +  c(u)ux  =  0,  x  G  R,  t  >  0, 

u(x,  0)  =  </>(#),  xGl. 


(1.16) 

(1.17) 
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We  think  of  u  as  a  density  and  c(u)  as  the  speed  that  waves  propagate.  In  many 
physical  problems  the  speed  that  waves  propagate  increases  with  the  density, 
so  we  assume  for  now  that  d (u)  >  0. 

Consistent  with  the  solution  method  for  linear  advection  equations,  we 
define  the  characteristic  curves  as  integral  curves  of  the  differential  equation 

dx  ,  .  , 

—  =  c(W).  (1.18) 

Then  along  a  particular  characteristic  curve  x  =  x(t)  we  have 


du 

dt 


(x(t),t)  =  ux(x(t),t)c(u(x(t))  +  Ut(x(t),t)  =  0. 


Therefore,  like  linear  equations,  u  is  constant  along  the  characteristic  curves. 
The  characteristics  curves  are  straight  lines  because 

d2x  d 
dt 2  dt 


( dx \  d  ,  ,  /  \ \  ,,  sdu 

(w)  =  ;r<“M,>>  =  c>>*=°- 


In  the  nonlinear  case,  however,  the  speed  of  the  characteristic  curves  as  defined 
by  (1.18)  depends  on  the  value  u  of  the  solution  at  a  given  point.  To  find  the 
equation  of  the  characteristic  C  through  (x,  t )  we  note  that  its  speed  is 

|=cW?,0))  =  cWO) 

(see  Figure  1.5).  In  the  xt  coordinate  system,  the  speed  of  a  signal  is  the 
reciprocal  of  its  slope.  This  results  from  applying  (1.18)  at  (£,  0).  Thus,  after 
integrating,  the  characteristic  curve  is  given  by 


x  =  c(<j>(^))t  +  e  (i.i9) 

Equation  (1.19)  defines  £  =  £(#,£)  implicitly  as  a  function  of  x  and  t,  and  the 
solution  u(x,t)  of  the  initial  value  problem  (1.16)  and  (1.17)  is  given  by 

u(x,t)  =  (/>(£)  (1.20) 

where  £  is  defined  by  (1.19). 

In  summary,  for  the  nonlinear  advection  equation  (1.16): 

(a)  Every  characteristic  curve  is  a  straight  line. 

(b)  The  solution  u  is  constant  on  each  such  characteristic. 

(c)  The  speed  of  each  characteristic,  is  equal  to  the  value  of  c(u)  on  that 
characteristic. 

(d)  The  speed  c(u)  is  the  speed  that  signals,  or  waves,  are  propagated  in  the 
system. 
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Figure  1.5  A  diagram  showing  characteristics,  or  signals,  moving  at  different 
speeds;  each  characteristic  carries  a  constant  value  of  u  determined  by  its  initial 
value  at  t  =  0,  at  the  point  (£,  0).  The  equation  of  the  characteristic  shown, 
from  (£,  0)  to  (x,£),  is  given  by  (1.19).  Its  slope  in  the  xt  coordinate  system  is 
the  inverse  of  its  speed 


Figure  1.6  Initial  wave  profile  in  Example  1.12 


Example  1.12 

Consider  the  initial  value  problem 

ut  +  uux  =  0,  xGl,  t  >  0, 

(  2,  x  <  0, 

u(x,  0)  =  4>{x)  =  <  2  —  x,  0  <  x  <  1, 

[  1,  x  >  1. 

The  initial  curve  is  sketched  in  Figure  1.6.  Since  c(u)  =  u  the  characteristics 
are  straight  lines  emanating  from  (£,  0)  with  speed  c(</>(£))  =  </>(£).  These  are 
plotted  in  Figure  1.7.  For  x  <  0  the  lines  have  speed  2;  for  x  >  1  the  lines  have 
speed  1;  for  0  <  x  <  1  the  lines  have  speed  2  —  x  and  these  all  intersect  at  (2, 
1).  Immediately  one  observes  that  a  solution  cannot  exist  for  t  >  1,  because 
the  characteristics  cross  at  that  time  and  they  carry  different  constant  values  of 
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Figure  1.7  Characteristic  diagram  showing  colliding  characteristics 


u.  Figure  1.8  shows  several  wave  profiles  that  indicate  steepening  of  the  signal 
as  it  propogates.  At  t  =  1  the  wave  breaks ,  which  is  the  first  instant  when  the 
solution  would  become  multiple  valued.  To  find  the  solution  for  t  <  1  we  note 
that  u(x,t)  =  2  for  x  <  2t  and  u(x,t)  =  1  for  x  >  t  +  1.  For  2t  <  x  <  t  +  1 
equation  (1.19)  becomes 

x  =  (2  — 


which  gives 

t  x  —  2t 
^  ~  1  -  t  ’ 

Equation  (1.20)  then  yields 


u(x,  t) 


2  —  x 

1  -V 


2t  <  x  <  t  +  1,  t  <  1. 


This  explicit  form  of  the  solution  also  indicates  the  difficulty  at  the  breaking 
time  t  =  1.  □ 
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In  general  the  initial  value  problem  (1.16) — (1.17)  may  have  a  solution  only 
up  to  a  finite  time  £&,  which  is  called  the  breaking  time.  Let  us  assume  in 
addition  to  d  (u)  >  0  that  the  initial  wave  profile  satisfies  the  conditions 

4>(x)  >  0,  <  0. 


At  the  time  when  breaking  occurs  the  gradient  ux  will  become  infinite.  To 
compute  ux  we  differentiate  (1.19)  implicitly  with  respect  to  x  to  obtain 


sic 


1 


i  +  c,o(£))0,(or 


Then  from  (1.20) 


u 


no 


i  +  c' 0(6)^' (£)C 

The  gradient  catastrophe  will  occur  at  the  minimum  value  of  £,  which  makes 


the  denominator  zero.  Hence 


1 


4  'Tnontioy 


tb  >  0. 


In  the  last  example,  c(u)  =  u  and  </>(£)  =  2  — Hence  <//(£)c'(0(£))  =  (— 1)(1)  = 
—  1  and  tb  =  1  is  the  time  when  breaking  occurs. 

In  summary  we  showed  that  the  nonlinear  partial  differential  equation 


Ut  +  c(u)ux  =  0,  d(u)  >  0 


propagates  the  initial  wave  profile  at  a  speed  c(u),  which  depends  on  the  value 
of  the  solution  u  at  a  given  point.  Since  d (u)  >  0,  large  values  of  u  are  propa¬ 
gated  faster  than  small  values  and  distortion  of  the  wave  profile  occurs.  This  is 
consistent  with  our  earlier  remarks.  Wave  distortion  can  occur  and  shock  waves , 
or  discontinuities,  develop  in  materials  because  of  the  property  of  the  medium 
to  transmit  signals  more  rapidly  at  higher  levels  of  stress  or  pressure.  Mathe¬ 
matically,  distortion  and  the  development  of  shocks  or  discontinuous  solutions 
are  distinctively  nonlinear  phenomena  caused  by  the  advection  term  c(u)ux. 


Example  1.13 

(Implicit  solution)  When  the  advection  speed  is  constant,  we  showed  that 


Uf  T  cux  —  0 

has  general  solution  given  explicitly  by  u  =  F(x  —  ct),  where  F  is  an  arbitrary 
function.  A  similar  type  implicit  solution  can  occur  for  the  nonlinear  equation 


ut  +  c(u)ux  =  0. 


24 


1.  The  Physical  Origins  of  Partial  Differential  Equations 


In  an  exercise,  the  reader  is  asked  to  show,  using  the  chain  rule,  that  the 
expression 

u  =  F(x  —  c(u)t ), 

defines  the  solution  u  =  u(x,  t )  implicitly,  when  it  exists.  The  arbitrary  function 
F  is  determined,  for  example,  by  an  initial  condition.  □ 

Example  1.14 

Consider  the  PDE 

o 

Ut  +  u  ux  =  0. 

The  general  solution  is  given  implicitly  by  u  —  F(x  —  u2t ),  which  is  easily 
verified.  If  u(x,  0)  =  x,  then  F(x)  =  x  and  u  =  x  —  u2t.  Solving  for  u  gives 

u  =  ^  (-1  ±  Vl  +  4 tx) 

=  —  (—1  +  Vl  +  4 tx)  , 

where  we  have  taken  the  positive  square  root  to  meet  the  initial  condition.  The 
solution  is  valid  for  t  <  —  l/4x.  (See  the  Exercises.)  □ 

Example  1.15 

(Traffic  flow)  Everyone  who  drives  has  experienced  traffic  issues,  such  as  jams, 
poorly  timed  traffic  lights,  high  road  density,  etc.  In  this  abbreviated  example 
we  suggest  how  some  of  these  issues  can  be  understood  with  a  simple  model. 
Traffic  moving  in  a  single  direction  x  with  car  density  p(x,  £),  given  in  cars  per 
kilometer,  can  be  modeled  by  a  conservation  law 

Pt  +  (pv)x  =  0, 

where  v  =  v(x,t)  is  the  local  car  speed  (kilometers  per  hour),  and  pv  is  the 
the  flux,  in  cars  per  hour.  Importantly,  we  are  making  a  continuum  assumption 
about  cars,  surely  a  questionable  one.  As  always,  we  need  a  constitutive  assump¬ 
tion  to  close  the  system.  By  experience,  the  speed  of  traffic  surely  depends  on 
traffic  density,  or,  v  =  F(p).  The  simplest  model  is  to  assume  that  the  flux 
pv  is  zero  when  p  is  zero,  and  is  jammed  (no  flux)  when  the  density  is  some 
maximum  value  pj.  Therefore,  we  take 
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where  vm  is  the  maximum  velocity  of  cars.  Note  that  this  flux  curve  is  a 
parabola,  concave  down.  The  conservation  law  can  then  be  written  after  rescal¬ 
ing  as 


ut  +  vm[u(  1 


u  = 


P 

PM 


This  equation  can  be  expanded  to 


ut  +  um(1  -  2 u)ux  =  0, 

so  c(u)  =  vm{  1  ~  2 u)  is  the  speed  that  traffic  waves  move  in  the  system.  It 
is  different  from  the  speed  of  cars!  Because  c'(u)  <  o,  signals  are  propagated 
backward  into  the  traffic  flow.  This  is  reasonable.  When  you  drive  and  a  traffic 
density  change  occurs  ahead  of  you  (such  a  slowing  down),  that  signal  moves 
backward  into  the  line  of  cars  and  eventually  you  are  forced  to  slow  down.  □ 


EXERCISES 


1. 


How  does  the  basic  conservation  law  (1.8)  change  if  the  tube  has  variable 
cross-sectional  area  A  =  A(x)  rather  than  a  constant  cross-sectional  area? 
(Assume  that  the  variation  in  area  is  small  over  the  interval.)  Derive  the 
formula 


Ut  T  (fix 


2.  Solve  the  initial  value  problem 

Ut  +  cux  =  0,  x  G  R,  t  >  0;  u(x,  0)  =  e~x  ,  x  E  R. 

Pick  c  =  2  and  sketch  the  solution  surface  and  several  time  snapshots.  Do 
you  see  a  traveling  wave?  Sketch  the  characteristic  curves  in  the  aT-plane. 

3.  Find  the  general  solution  of  the  advection-decay  equation  (1.12)  by  trans¬ 
forming  to  characteristic  coordinates  £  =  x  —  ct,  r  =  t. 

4.  Show  that  the  decay  term  in  the  advection-decay  equation  (1.12)  can  be 
removed  by  making  a  change  of  the  dependent  variable  to  w  =  uext. 

5.  Solve  the  pure  initial  value  problems  in  the  region  xGl,  t  >  0. 

ut  +  xtux  =  0,  u(x,  0)  =  f(x) 

and 

ut  +  xux  =  e*,  u(x,  0)  =  f(x). 


6.  Solve  the  following  initial  value  problems. 
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a)  ut  +  xux  =  —tu,  xGl,  t  >  0;  u{x,  0)  =  /(#),  xGi 

b)  tut  +  =  — 2iq  xGl,  t  >  1;  i/(x,  1)  =  /(#),  xGl. 

c)  ut  +  ux  =  — tu ,  xGR,  t  >  0;  u(x,  0)  =  /(#),  xGl. 

d)  2 ut  +  ux  =  —2u,  x,t  G  R,  t  >  0;  u(x,t)  =  f(x,t)  on  the  straight  line 
x  =  t,  where  /  is  a  given  function  . 

e)  tuut  +  x  G  R,  t  >  1;  u(x,  1)  =  f(x),  xGi 

7.  Solve  the  initial  boundary  value  problem 

Ut  +  cux  =  —At/,  x,  t  >  0, 
t/(x,0)  =  0,  x  >  0,  t/(0,t)  =  g(t ),  t  >  0. 

In  this  problem  treat  the  domains  x  >  ct  and  x  <  ct  differently  as  in 
Example  1.14;  the  boundary  condition  affects  the  solution  region  x  <  ct , 
and  the  initial  condition  affects  it  in  the  region  x  >  ct. 

8.  Solve  the  pure  initial  value  problem 

Ut  +  ux  —  3u  =  t,  x  G  R,  t  >  0, 
u(x,  0)  =  x2,  xGl. 

9.  To  study  the  absorption  of  nutrients  in  an  insect  gut  we  model  its  diges¬ 
tive  tract  by  a  tube  of  length  l  and  cross-sectional  area  A.  Nutrients  of 
concentration  n  =  n(x,  t)  flow  through  the  tract  at  speed  c,  and  they  are 
adsorbed  locally  at  a  rate  proportional  to  y/n.  What  is  the  PDE  model? 
If  the  tract  is  empty  at  t  =  0  and  then  nutrients  are  introduced  at  the 
constant  concentration  no  at  the  mouth  {x  =  0)  for  t  >  0,  formulate  an 
initial  boundary  value  problem  for  n  =  n(x,t).  Solve  this  PDE  model  and 
sketch  a  graph  of  the  nutrient  concentration  exiting  the  tract  at  x  =  l  for 
t  >  0.  Physically,  explain  why  is  the  solution  n(x,  t)  =  0  for  x  >  ct. 

10.  Explain  why  the  function  u{x,t)  =  G(x  +  ct),  c  >  0,  is  called  a  left- 
traveling  wave.  Explain  how  you  would  solve  the  advection  equation 

ut  —  cux  =  F{x,  t,  u)l 

11.  The  density  of  cars  on  a  busy  one-lane  freeway  with  no  exits  and  entrances 
is  u  —  u{x ,  t)  cars  per  mile.  If  </>  =  </>(ay  t)  is  the  flux  of  cars,  measured  in 
cars  per  hour,  derive  a  conservation  law  relating  the  density  and  flux.  Why 
would  ( j)  =  au{/3  —  u)  (a,  /?  >  0)  be  a  reasonable  assumption?  Write  down 
the  resulting  nonlinear  PDE  for  u. 
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12.  Find  a  formula  that  implicitly  defines  the  solution  u  =  u(x,  t)  of  the  initial 
value  problem  for  the  reaction-advection  equation 

au 

ut  +  cux  =  — ,  xGl,  t>  0, 

p  +  u 

u(x,  0)  =  /(#),  xGl. 

Here,  i?,  a,  and  [3  are  positive  constants.  Show  from  the  implicit  formula 
that  you  can  always  solve  for  u  in  terms  of  x  and  t. 

13.  Write  a  formula  for  the  general  solution  of  the  equation 

Ut  +  cux  =  f(x)u. 

Hint:  Your  answer  should  involve  an  integral  with  variable  limits  of  inte¬ 
gration. 

14.  Consider  the  Cauchy  problem 

ut  =  xuux ,  x  G  R,  t  >  0, 
u(x,  0)  =  x,  xGl. 

Find  the  characteristics,  and  find  a  formula  that  determines  the  solution 
u  =  u(x,  t)  implicitly  as  a  function  of  x  and  t.  Does  a  smooth  solution  exist 
for  all  t  >  0? 


15.  Consider  the  initial  value  problem 


ut  +  uux  =  0,  x  G  R,  t  >  0, 


u(x,  0)  = 


1  —  x2, 

0, 


x 

X 


<  1, 

>  1. 


Sketch  the  characteristic  diagram.  At  what  time  4  does  the  wave  break? 
Find  a  formula  for  the  solution. 

16.  Consider  the  initial  value  problem 

ut  +  uux  =  0,  x  G  R,  t  >  0, 
u(x,  0)  =  exp(— x2),  x  G  R. 


Sketch  the  characteristic  diagram  and  find  the  point  (#&,£&)  in  space-time 
where  the  wave  breaks. 


17.  Consider  the  Cauchy  problem 

ut  +  c(u)ux  =  0,  x  G  R,  £  >  0, 

u(x,  0)  =  /(#),  xGl. 

Show  that  if  the  functions  c(u)  and  f(pc)  are  both  nonincreasing  or  both 
nondecreasing,  then  no  shocks  develop  for  t  >  0. 
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18.  Consider  the  problem 


Ut  +  u2ux  =  0,  x  G  R,  t  >  0, 
u(x,  0)  =  x,  xGl. 


Derive  the  solution 


u(x,  t) 


x ,  t  =  0, 

I  XkEgL ;  i  ^  o,  1  +  4te  >  0. 


When  do  shocks  develop?  Verify  that  limt^0+  ^0*5  t)  =  x. 
19.  Consider  the  signaling  problem 

ut  +  c(u)ux  =  0,  t  >  0,  x  >  0, 

0)  =  uo,  x  >  0, 

u(0,  t)  =  g(t),  t  >  0, 


where  c  and  g  are  given  functions  and  uq  is  a  positive  constant.  If  c'(u )  >  0, 
under  what  conditions  on  the  signal  g  will  no  shocks  form?  Determine  the 
solution  in  this  case  in  the  domain  x  >  0,  t  >  0. 

20.  In  the  traffic  flow  model  in  Example  1.15,  explain  what  occurs  if  the  initial 
car  density  has  each  of  the  following  shapes:  (a)  a  density  bump  in  the 
traffic  having  the  shape  of  a  bell-shaped  curve;  (b)  a  density  dip  in  the 
traffic  having  the  shape  of  an  inverted  bell-shaped  curve;  (c)  a  density  that 
is  jammed  for  x  <  0,  with  no  cars  ahead  for  x  >  0  (a  stop  light);  (d)  a 
density  that  is  shaped  like  a  curve  tt/2  +  arctanx  where  the  traffic  ahead 
has  increasing  density. 

In  each  case,  sketch  a  qualitative  characteristic  diagram  and  sketch  several 
density  profiles.  On  the  characteristic  diagram  sketch  a  sample  car  path. 

21.  The  height  h  =  h(x,t)  of  a  flood  wave  can  be  modeled  by 


ht  +  (vh)x  =  0, 

where  v,  the  average  stream  velocity,  is  v  =  a\fh ,  a  >  0  (Chezy’s  law). 
Show  that  flood  waves  propagate  1.5  times  faster  than  the  average  stream 
velocity. 

22.  Explain  why  the  I  VP 


Ut  +  ux  =  x,  x  G  R,  u(x,  x)  =  1,  xGl, 


has  no  solution. 

23.  Solve  the  PDE  ut  +  ux 


0  with  ^(cos  0,  sin  0)  =  0,  0  <  0  <  2tt. 
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1.3  Diffusion 

The  basic  conservation  law  (1.8)  with  no  sources  is 

ut  +  4>x  =  0*  (1-21) 

To  reiterate,  u  =  u(x,  t )  represents  the  density  of  a  physical  quantity,  and 
(j)  =  <f(x,t)  represents  its  flux.  Equation  (1.21)  describes  locally  how  changes  in 
density  are  related  to  changes  in  flux.  In  the  last  section  we  modeled  advection 
by  assuming  that  the  flux  was  proportional  to  the  density  (flux  equals  velocity 
times  density,  or  0  =  cu).  Now  we  want  to  model  a  simple  diffusion  process. 

To  fix  the  notion  let  u  denote  the  concentration  of  some  chemical  species, 
say  a  gas,  in  a  tube.  We  expect  that  the  random  motion  and  collisions  of  the 
molecules  will  cause  concentrations  of  the  gas  to  spread  out;  the  gas  will  move 
from  higher  concentrations  to  lower  concentrations.  The  same  could  be  said  for 
insects  in  a  tube,  people  congregated  in  a  hallway,  or  heat  energy  in  a  metal 
bar. 

To  model  this  type  of  motion,  which  is  based  on  random  collisions,  we  make 
two  observations: 

(i)  the  movement  is  from  higher  concentrations  to  lower  concentrations. 

(ii)  the  steeper  the  concentration  gradient ,  or  the  derivative,  the  greater  the 
flux. 

Therefore,  the  flux  should  depend  on  the  ^-derivative  of  the  density  (which 
measures  the  steepness  of  the  density  curve).  Assuming  a  simple  linear  rela¬ 
tionship,  we  take 

<p  =  — Dux ,  (Fick’s  law)  (1.22) 

where  D  >  0  is  a  constant  of  proportionality.  The  minus  sign  guarantees  that 
if  ux  <  0,  then  <f  will  be  positive  and  the  flow  will  be,  by  our  convention,  to 
the  right;  if  ux  >  0,  then  <p  will  be  negative  and  the  flow  will  be  to  the  left. 
We  say  that  the  flow  is  down  the  gradient.  Equation  (1.22)  is  called  Fick’s 
law,  and  the  constant  D  is  called  the  diffusion  constant;  D  is  measured  in 
length-squared  per  unit  time.  (See  Figure  1.9.) 

When  the  constitutive  equation  (1.22)  is  substituted  into  the  conservation 
law  (E21),  we  obtain  a  simple  model  equation 

DuXx  —  0? 

which  is  called  the  diffusion  equation.  This  PDE  model  is  one  of  the  funda¬ 
mental  equations  in  applied  mathematics,  physics,  engineering,  and  biology. 

Geometrically,  and  physically,  the  diffusion  equation  states  that  if  uxx  >  0 
at  a  point,  or  the  concentration  profile  is  concave  up,  then  ut  >  0,  or  the 
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Figure  1.9  A  time  snapshot  of  a  concentration  profile  u(x,  t ).  Diffusive  motion 
is  from  higher  concentrations  to  lower  concentrations,  or  down  the  gradient.  The 
magnitude  of  the  flux  is  proportional  to  the  slope 


concentration  increases  at  that  point.  Alternately,  if  the  profile  is  concave  down 
at  a  point,  then  uxx  <  0  and  the  concentration  decreases,  ut  <  0.  Thus,  there 
is  a  flattening  effect.  This  is  illustrated  in  Figure  1.10. 


Figure  1.10  A  time  snapshot  of  a  concentration  profile  u(x,t).  Where  the 
profile  is  concave  down,  the  concentration  decreases;  where  it  is  concave  up,  it 
increases 


Example  1.16 

(Heat  Conduction)  Consider  heat  flow  in  a  one-dimensional  bar  having  a 
constant  density  p  and  constant  specific  heat  C .  Both  of  these  constants  are 
physical  parameters  that  are  tabulated  in  engineering  and  physics  handbooks. 
The  specific  heat  is  the  amount  of  energy  required  to  raise  a  unit  mass  of 
material  one  degree,  and  it  is  given  in  units  of  energy  per  mass  per  degree.  We 
may  apply  the  basic  conservation  law  (1.21)  to  the  bar,  with  u  being  the  energy 
density  given  by  u(x,t)  =  pC0(x,t),  where  0  =  0{x,t)  is  the  temperature  at 
(x,  t )  (the  stipulation  that  energy  is  proportional  to  temperature  is,  in  itself, 
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an  assumption  about  the  medium).  Therefore, 

pCOt  +  <l>x=  0  (1.23) 

is  an  expression  of  energy  balance  in  the  bar  when  no  sources  are  present.  The 
energy  flux  <p  is  assumed  to  be  given  by  a  Fick’s  law- type  expression 

<f>  =  —K0Xl  (Fourier’s  law)  (1-24) 


where  K  is  thermal  conductivity,  another  physical  constant.  In  the  context 
of  heat  flow,  (1.24)  is  called  the  Fourier  heat  law.  This  law,  a  constitutive 
relation  based  on  empirical  evidence,  is  a  statement  of  the  fact  that  heat  flows 
from  hotter  regions  to  colder  regions;  stated  differently,  heat  flows  down  the 
temperature  gradient.  This  statement  is  a  manifestation  of  the  second  law  of 
thermodynamics.  Now,  we  may  substitute  (1.24)  into  (1.23)  to  obtain  a  single 
equation  for  the  temperature  6(x,t),  namely, 


@t  kOxx  —  0? 


(1.25) 


Equation  (1.25)  is  the  heat  equation;  it  is  the  diffusion  equation  in  the  context 
of  heat  flow.  The  constant  k  is  called  the  diffusivity  and  is  a  property  of  the 
medium;  values  of  k  for  different  media  (metals,  plastics,  etc.)  can  be  found 
in  physical  handbooks.  Note  that  k  has  the  same  dimensions  (length-squared 
per  time)  as  the  diffusion  constant  D.  In  the  sequel  we  shall  often  use  u  in 
place  of  6  for  the  temperature  function.  The  physical  interpretation  is  shown 
in  Figure  1.10,  where  u  is  the  temperature.  □ 


Example  1.17 


(Time  scale)  Associated  with  a  simple  heat  conduction  problem  is  its  charac¬ 
teristic  time,  or  time  scale,  an  easily  computed  number  that  roughly  indicates 
how  long  it  takes  for  heat  to  flow  through  a  region.  We  could  call  this  number 
a  ‘quick  engineering  estimate.’  This  quantity  is  found  by  noting  the  diffusivity 
k  has  units  length-squared  divided  by  time.  Thus,  if  L  is  the  length  of  a  region 
and  T  is  the  time  for  discernible  temperature  changes  to  occur ,  then 


Thus  T  =  L2 /k  is  called  the  time  scale  for  the  problem. 


□ 
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In  some  cases  the  thermal  conductivity  K  in  (1.24)  may  not  be  constant,  but 
rather  may  depend  on  x  if  the  bar  is  nonhomogeneous;  over  large  temperature 
ranges  the  conductivity  also  depends  on  the  temperature  0.  If,  for  example, 
K  =  K(6),  then  we  obtain  a  nonlinear  heat  model 

pCOt  -  {K{9)9X)X  =  0. 

It  is  possible,  of  course,  that  the  density  and  specific  heat  could  depend  on 
the  location  x  or  the  temperature  6.  If  C,  p,  or  K  depend  on  time,  then  the 
material  is  said  to  have  memory. 

Example  1.18 

(Advection— Diffusion)  If  both  diffusion  and  advection  are  present  (think  of 
smoke  diffusing  from  a  smokestack  on  a  windy  day),  then  the  flux  is  given  by 

(j)  =  cu  —  Dux , 

and  the  conservation  law  (1.21)  becomes 

Ut  +  cux  —  Duxx  =  0, 

which  is  the  advection— diffusion  equation.  This  equation  would  govern  the 
density  of  a  chemical,  say,  that  is  being  advected  by  the  bulk  motion  of  a 
fluid  moving  at  velocity  c  in  which  it  is  dissolved,  while  at  the  same  time  it  is 
diffusing  according  to  Fick’s  law.  If  the  chemical  also  decays  at  rate  A,  then  we 
include  a  source  term,  and  the  model  becomes 

Ut  +  cux  —  Duxx  =  —Xu, 

which  is  a  advection— diffusion— decay  equation.  □ 

Example  1.19 

(Contaminant  flow  in  aquifers)  Water  resources  is  one  of  the  most  impor¬ 
tant  issues  that  societies  face.  One  example  is  is  how  contaminants  from  chem¬ 
ical  spills,  etc.,  are  carried  through  subsurface  structures.  Soil  is  a  porous 
medium  consisting  of  a  fixed  soil  matrix  interspersed  with  open  pores  through 
which  groundwater  flows.  The  fraction  of  water  volume  to  total  volume  is  c o, 
which  is  called  the  porosity.  Simply,  think  of  the  medium  as  a  long  cylinder  of 
cross-sectional  area  A.  A  solute,  or  contaminant,  carried  by  the  water,  has  con¬ 
centration  C  =  C(x,t),  which  is  the  mass  of  the  solute  divided  by  the  volume 
of  the  water.  The  conservation  law  for  the  solute  states  that  the  rate  of  change 
of  the  amount  of  solute  in  the  interval  equals  its  flux  plus  the  rate  St  that  the 
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solute  is  adsorbed,  or  desorbed,  by  the  soil  particles;  the  quantity  S  =  S(x,t) 
is  the  amount  of  the  solute  that  is  sorbed  onto  the  soil  particles: 


d_ 

dt 


f  ojC  Adx  =  A(j)(a,t)  —  A(j)(b,t) 

J  a 


io  St  Adx. 


There  are  three  sources  of  flux:  advection,  molecular  diffusion,  and  kinematic 
dispersion.  As  before,  advection  is  bulk  motion  due  to  the  solute  being  carried 
by  the  water  flow.  Hydrogeologists  measure  the  discharge  q  (volume  of  water 
per  time)  and  the  filtration,  or  Darcy ,  velocity  V  =  q/A,  the  discharge  per 
area.  The  advective  flux  is 


0(a)  =  qC  =  AVC. 


Molecular  diffusion  is  just  the  random  motion  of  particles  in  still  water  and  is 
measured  by  Fick’s  law,  and  kinematic  dispersion ,  or  spreading,  is  caused  by 
the  ‘tortuous’  motion  through  the  winding  porous  pathways  of  the  medium. 
These  are  given  by,  respectively, 

0(diff)  =  - AujDCx ,  0(disp)  =  -AljD^Cx, 


where  D  >  0  is  the  diffusion  constant  and  DA)  is  the  dispersion  coefficient. 
The  latter  depends  on  the  filtration  velocity;  for  example,  DA)  =  a  \  V  |.  The 
total  flux  Acj)  is  the  sum  of  these  three  fluxes.  Substituting  these  definitions 
into  the  conservation  law  and  using  the  arbitrariness  of  the  interval,  we  get  in 
the  usual  way 

Ct  =  (D  +  D^)CXX  -  vCx  -  St,  (1.26) 


where  v  =  V/lv  is  the  average  velocity.  It  is  generally  true  that  dispersion 
is  much  stronger  than  molecular  diffusion,  or  D^d)  >>  D.  Equation  (1.26) 
is  one  of  the  most  important  equations  in  hydrogeology,  and  it  is  called  the 
reaction— advection— dispersion  equation.  There  are  two  unknowns  in  (1.26), 
the  contaminant  concentration  C  in  the  water  and  the  amount  S  sorbed  onto 
the  soil.  We  should  expect  a  constitutive  relationship  between  the  two.  The 
simplest  assumption  is  that  there  an  equilibrium  of  the  form  S  =  /(C),  for 
example, 


S  =  KC  or 


aC 

1  +  pc' 


These  relationships  clearly  lead  to  a  single  PDE  for  C .  One  the  other  hand, 
chemical  kinetic  relations  of  the  form  St  =  F(C,  S)  model  nonequilibrium  pro¬ 
cesses.  In  this  case  there  are  coupled  PDEs  for  C  and  S.  The  reader  can  consult 
Fetter  (1993)  or  the  monograph  by  Logan  (2001).  □ 
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Diffusion-like  equations  are  generally  accompanied  by  an  initial  condition 
that  specifies  the  density  at  time  t  —  0  at  all  points  of  the  spatial  domain,  as  well 
as  boundary  conditions  that  specify  conditions  on  the  density  at  the  boundaries 
of  the  domain  for  all  time.  This  is  especially  true  for  heat  conduction  as  well, 
the  topic  on  which  we  focus.  To  fix  the  idea,  consider  a  metal  cylinder  of  finite 
length  0  <  x  <  l.  Then  an  initial  condition  has  the  form 

u(x,  0)  =  uq(x),  0  <  x  <  Z, 

where  no  is  a  prescribed  initial  temperature  distribution  over  the  length  of  the 
bar.  There  are  three  types  of  boundary  conditions  that  usually  arise  in  physical 
problems.  If  the  temperature  is  specified  at  the  boundary  x  =  0,  for  example, 
then  we  have 

u(0,  t )  =  g(t ),  t  >  0, 

where  g  is  a  given  function;  this  condition  is  called  a  Dirichlet  condition.  We 
could  also  specify  the  energy  flux  at  a  boundary,  for  example, 

-Kux(0,t)  =  h(t),  t  >  0, 

which  is  called  a  Neumann  condition.  If  h(t)  =  0,  then  we  say  that  the 
boundary  is  insulated;  heat  cannot  flow  in  or  out.  If  g  or  h  is  zero,  we  also 
say  that  the  corresponding  boundary  condition  is  homogeneous. 

A  third  type  of  boundary  condition  has  the  form  (say,  at  x  =  0) 

-Kux(0,t)  =  -rj(u(Q,t)  -  t  >  0. 

For  heat  flow  in  a  bar,  for  example,  this  law  expresses  Newton’s  law  of  cooling, 
which  states  that  the  heat  flux  is  proportional  to  the  temperature  difference 
between  the  end  of  the  bar  and  a  given  temperature  2p(t)  of  the  environment; 
r]  is  the  constant  of  proportionality  representing  a  heat-loss  factor.  These  types 
of  conditions  are  called  radiation,  or  Robin,  conditions. 

The  requirement  of  a  well-posed  problem  is  that  the  PDE  be  accompanied 
by  appropriate  initial  and  boundary  conditions  so  that  the  resulting  mathemat¬ 
ical  problem  has  a  unique  solution  as  well  as  a  correct  physical  meaning. 

Steady-State  Solutions 

Many  PDE  models  have  the  property  that  after  a  long  time  the  transients 
caused  by  initial  conditions  decay  away  and  the  solution  approaches  a  steady- 
state  u  =  u(x),  dependent  only  upon  x.  In  this  case  the  time-derivatives  in 
the  equation  vanish  and  u(x)  satisfies  an  ODE  with  the  appropriate  boundary 
conditions.  A  simple  example  is  given  by  the  heat  equation  ut  =  kuxx.  The 
steady-state  solution  u  =  u(x)  satisfies  ku"(x)  =  0,  which,  when  solved,  gives 
a  linear  temperature  profile  u(x)  =  ax  +  b.  The  constants  a  and  b  can  be 
determined  by  the  boundary  conditions. 
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Example  1.20 

Consider  an  initial  boundary  value  problem  for  the  diffusion-decay  equation: 

ut  =  Duxx  —  ru ,  0  <  x  <  L,  t  >  0, 

u(Q,t)  =  0,  —Dux(L,t)  =  —  1,  £  >  0, 

0)  =  g(x),  0  <  x  <  L. 

This  model  could,  for  example,  represent  a  diffusing  pollutant  in  a  canal  with 
per  capita  decay  rate  r.  At  the  left  boundary  its  density  is  maintained  at  zero, 
while  at  the  right  boundary  the  pollutant  is  introduced  into  the  canal  at  rate 
1  per  unit  area  per  unit  time.  The  steady-state  model  is  the  pure  boundary 
value  problem 


Du"  —  ru  =  0,  0  <  x  <  L, 

u(0)  =  0,  — Du!  [L )  =  — 1 


for  u  =  u(pc).  The  initial  condition  is  ignored  because  transients  are  assumed 
to  have  decayed  away  and  longer  affect  the  solution.  To  solve  the  differential 
equation  we  apply  elementary  methods  (see  the  Appendix)  to  obtain 

u(x)  =  ci  sinh (y/r/D  x)  +  c 2  cosh (yjr / D  x), 


where  c\  and  C2  are  arbitrary  constants.  The  boundary  condition  u(0)  =  0  forces 
C2  =  0,  giving  u{x)  =  c\  sinh (yJr/Dx).  The  right  boundary  condition  gives 
—D^/r/D  c\  cosh {^/r / D  L)  =  — 1,  or  ci  =  1/y/rD  cosh( \frjD  L).  Therefore 
the  steady-state  solution,  representing  the  long  time  concentration  in  the  canal, 

is 


sinh(  y/r/D  x) 
\JrD  cos  H^/rjDL) 


□ 


Pure  boundary  value  problems  for  ODEs  are  unlike  initial  value  problems 
in  that  they  do  not  always  have  a  solution,  or  they  may  have  many  solutions. 
Thus,  there  is  no  guarantee  that  the  system  has  a  steady-state  solution.  If  there 
is  a  steady-state,  it  may  be  unstable  and  the  system  will  not  sustain  it.  These 
issues  warrant  further  analysis,  and  they  are  discussed  in  Chapters  4  and  5. 


EXERCISES 

1.  Heat  flows  longitudinally  through  a  laterally  insulated  metal  bar  of  length 
10  centimeters,  and  the  temperature  u  =  u(x,  t )  satisfies  the  diffusion  equa¬ 
tion  ut  =  kuxx ,  where  k  =  0.02  square  centimeters  per  second.  Suppose 
the  temperatures  at  some  fixed  time  T  at  x  =  4,  6,  8  cm  are  58,  64,  and  72 


36 


1.  The  Physical  Origins  of  Partial  Differential  Equations 


degrees,  respectively.  Estimate  uxx(6,T)  using  a  difference  approximation. 
Will  the  temperature  at  x  =  6  increase  or  decrease  in  the  next  instant  of 
time?  Estimate  the  temperature  at  x  =  6  at  T  +  0.5  seconds.  Hint:  Recall 
from  calculus  that 

tn,  \  _  /(«  -  h)  -  2 f(a)  +  f(a  +  h) 

J  W  ~  > 

where  h  is  a  small  increment.  This  approximation  is  also  derived  in  Chap¬ 
ter  6. 

2.  Let  u  =  u(x,t)  satisfy  the  heat  flow  model 


'U't  -  kltXXl  0  X  <C  /,  t  0, 

u(0,  t)  =  u(l,  t)  =  0,  t  >  0, 
u(x,  0)  =  u$(x),  0  <  x  <  l. 


Show  that 


u(x, 


t)2dx 


< 


uo(x)2dx , 


t  >  0. 


Hint:  Let  E(t)  =  u(x,t)2dx  and  show  that  E'(t)  <  0.  What  can  be  said 
about  u(x,t)  if  uq(x)  =  0? 


3.  Show  that  the  problem 


Uf 

u{  0,  t) 
u(x ,  0) 


JXXl 


ku: 
g(t), 
uq(x), 


0  <  x  <  /,  t  >  0, 
u(l,  t)  =  h(t),  t  >  0, 
0  <  x  <  /, 


with  nonhomogeneous  boundary  conditions  can  be  transformed  into  a  prob¬ 
lem  with  homogeneous  boundary  conditions.  Hint:  Introduce  a  new  depen¬ 
dent  variable  w  via  w(x,t)  =  u{x,t)  —  L(x,t)  by  subtracting  from  u  a 
linear  function  L{x,t)  of  x  that  satisfies  the  boundary  conditions  at  any 
fixed  time  t.  In  the  transformed  problem  for  re,  observe  that  the  PDE  picks 
up  a  source  term,  so  you  are  really  trading  boundary  conditions  for  source 
terms. 

4.  Consider  the  heat  flow  problem 


ut  =  kuxx  —  hu ,  0  <  x  <  1,  t  >  0, 
u(0,t)  =  u(l,t)  =  1,  t  >  0, 
u(x,  0)  =  g(x),  0  <  x  <  1, 

where  k,  h  >  0.  Find  the  steady  state  solution  and  describe  precisely  where 
heat  is  flowing  in  this  system. 
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5.  Show  that  the  ad vect ion-diffusion-decay  equation 

u+  =  Duxx  —  cux  —  Xu 

can  be  transformed  into  the  diffusion  equation  by  a  transformation  of  the 
form 

u(x,t)  =  w(x1t)eax~^t . 

Solution:  Take  a  =  c/ (2D),  /3  =  A  +  c2/ (4D). 

6.  Heat  flow  in  a  metal  rod  with  a  unit  internal  heat  source  is  governed  by 
the  problem 

ut  =  kuxx  +  1,  0  <  x  <  1,  t  >  0, 
i^(0,  t)  =  0,  ^(l,t)  =  l,  t  >  0. 

What  will  be  the  steady-state  temperature  in  the  bar  after  a  long  time? 
Does  it  matter  that  no  initial  condition  is  given? 

7.  A  bar  loses  heat  across  its  lateral  boundary  at  a  rate  proportional  to  the 
temperature  u.  The  equation  is 

ut  =  kuxx  —  au ,  0  <  x  <  1,  t  >  0, 

^(0,  t)  =  1,  u(l,t)  =  1,  t  >  0. 

Find  and  plot  the  steady-state  temperature  distribution  in  the  bar  and 
analyze  how  heat  is  flowing  in  the  bar  and  through  its  boundaries. 

8.  Bacteria  in  a  one-dimensional  medium  (a  tube  of  unit  cross-sectional  area, 
length  /,  and  capped  on  both  ends)  have  a  growth  rate  given  by  the  logis¬ 
tic  law  ru{  1  —  ujK\  where  r  is  the  growth  constant,  K  is  the  carrying 
capacity,  and  u  =  u(x ,  t)  is  the  density  of  bacteria  measured  in  bacteria 
per  unit  length.  Initially,  the  density  is  given  by  u  =  ax{l  —  x).  For  t  >  0 
the  bacteria  also  diffuse,  with  diffusion  constant  D.  Formulate  an  initial 
boundary  value  problem  for  the  density.  Use  intuition  to  sketch  a  sequence 
of  density  profiles  showing  how  the  density  evolves  in  time.  You  may  want 
to  consider  the  cases  al 2  <  4 K  and  al 2  >  4 K  separately.  Hint:  Think  about 
the  problem  if  no  diffusion  were  present. 

9.  Show  that  the  equation  ut  =  k{t)uxx  can  be  transformed  into  the  diffusion 
equation  by  changing  the  independent  variable  time  t  to  r  =  f(f}  k(r])dr]. 
Show  that  the  equation  ut  =  kuxx  —  b(t)ux  can  be  transformed  into  the 
diffusion  equation  by  changing  the  spatial  variable  to  £  =  x  —  Jo  b(v)drj- 
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10.  Burgers  equation  is  a  model  for  diffusion  and  nonlinear  advection,  and  it 
is  given  by 

Uf  =  Duxx  H-  • 

What  is  the  flux?  Show  that  Burgers  equation  can  be  transformed  into  the 
diffusion  equation  vt  =  vxx  using  the  Cole-Hopf  transformation  u  =  pjx, 
ip  =  —2D  In  v. 

11.  Derive  the  basic  conservation  law  for  diffusion  with  no  sources  if  the  cross- 
sectional  area  A  =  A(x)  of  the  tube  is  a  function  of  x.  Assume  the  diffu- 
sivity  D  is  constant  and  A{pc)  has  on  a  small  variation  along  the  length  of 
the  tube. 

12.  A  metal  bar  of  constant  cross-section  and  length  l  is  composed  of  two 
different  materials  in  perfect  contact.  In  0  <  x  <  xq  the  physical  parameters 
are  pi,  C\ ,  K\,  and  in  xo  <  x  <  l  the  physical  parameters  are  p2,  C2, 

(a)  At  the  contact  point  xo  state  specifically  why  the  temperature  and  the 
flux  must  be  continuous,  (b)  If  the  left  end  of  the  bar  x  =  0  is  maintained 
at  Tq  degrees,  and  the  right  end  x  =  l  is  maintained  at  Ti  degrees,  what  is 
the  steady  state  temperature  in  the  bar? 


1.4  Diffusion  and  Randomness 

The  notion  of  diffusion  was  introduced  in  Section  1.3,  mostly  in  the  context  of 
heat  conduction.  In  this  section  we  investigate  the  connection  between  diffusion 
and  random  motion  in  general  settings,  including  biological  settings.  Later,  in 
Chapter  5,  a  broader  treatment  of  life  science  models  is  offered. 

In  chemistry  and  physics  it  is  easy  to  deduce,  reasoning  at  an  atomic  or 
molecular  level,  how  substances  diffuse  because  of  random  motion.  This  micro¬ 
scopic  description  of  diffusion  is  based  on  statistics  and  the  fact  that  atoms 
or  molecules  collide  randomly.  These  random  collisions  cause  an  assemblage 
of  molecules  to  move  from  regions  of  high  concentrations  to  regions  of  lower 
concentrations.  For  example,  if  a  noxious  gas  is  released  at  a  point  in  a  room, 
the  colliding  molecules  of  gas  and  the  air  lead  to  the  eventual  dispersion  of  the 
gas  throughout  the  room.  This  stochastic,  molecular  model  is  the  basis  of  the 
kinetic  theory  of  gases,  and  it  has  a  firm  root  in  mathematics  and  in  empirics. 
In  chemically  based  biological  systems  where  we  are  tracking  molecular  con¬ 
centrations,  we  observe  the  same  phenomenon.  For  example,  a  chemical,  toxic 
to  fish,  might  be  dumped  locally  in  a  reservoir;  the  spread  of  the  chemical 
throughout  the  reservoir  can  be  understood  by  the  molecular  collision  model. 
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In  other  biological  systems,  however,  the  same  spreading  effects  are  observed, 
but  the  cause  is  not  random  collisions  on  a  microscopic  scale,  but  interactions 
on  a  macroscopic  scale.  This  effect  is  illustrated  by  the  spread  of  an  infectious 
disease.  If  infected  individuals  are  introduced  in  a  large  population  of  suscep¬ 
tible  individuals,  then  the  infection  can  spread  throughout  the  population  by 
personal  contact  or  by  respiration  of  air  born  particulates.  Even  in  clumped 
populations  of  organisms,  there  is  a  tendency  for  individuals  to  migrate  from 
the  high  density  regions  to  low  density  regions;  this  macroscopic  dispersion  is 
the  result  of  population  ‘pressures’  like  competition  for  resources  or  might,  in 
humans,  even  have  some  psychological  basis.  But  the  observation  is  the  same 
as  in  the  molecular  collision  model-movement  from  high  to  low  densities. 

To  elaborate  further,  physical  science  models  are  usually  based  on  scientific 
principles  and  laws  that  can  be  expressed  precisely  in  terms  of  nearly  exactly 
measurable  quantities;  the  goal  is  to  obtain  precise  quantitative  results.  For 
example,  there  is  little  question  that  Newton’s  laws  of  motion  or  Maxwell’s 
equations  of  electrodynamics  might  apply  to  some  physical  situation.  In  some 
areas  of  the  life  sciences,  however,  it  is  really  impossible  to  model  all  of 
the  intricacies  of  predator-prey  interactions,  tumor  growth  dynamics,  or  the 
spread  of  an  infectious  disease.  Therefore  the  models  are  more  phenomeno¬ 
logical  in  nature,  predicting  only  qualitative  features  rather  than  leading  to 
detailed  quantitative  results.  Biological  systems  are  highly  complex,  and  often 
we  attempt  to  describe  only  the  gross  features,  ignoring  much  of  the  fine  detail, 
the  stochasticity,  and  the  inherent  variability. 

We  begin  with  a  simple  thought  experiment  that  illustrates  how  diffusion 
and  advection  arise  naturally  in  population  dynamics.  Imagine  that  we  want 
to  study  the  dynamics  of  a  large  population  of  organism  in  a  one-dimensional 
linear  domain  length  L.  For  example,  we  could  equally  focus  on  bacteria,  cells, 
fish,  or  other  organisms,  and  regard  the  linear  domain  as  a  canal  or  waterway. 
From  elementary  differential  equations  we  recall  that  a  population  can  be  mod¬ 
eled  by  a  growth  law,  or  differential  equation,  for  the  population  u  =  u(t)  of 
organisms  at  time  t,  uniformly  over  its  spatial  domain.  Thus,  u  is  actually  a 
population  density.  When  resources  are  limited  and  there  is  competition  for 
those  resources,  and  the  population  is  often  modeled  by  the  logistic  model 


du(t) 

dt 


ru(t ) 


where  K  is  the  carrying  capacity  and  r  is  the  intrinsic  growth  rate.  The  solution 
to  this  model  equation  is  easily  obtained  by  separation  of  variables,  and  it  plots 
as  the  familiar  sigmoid,  or  S-shaped,  curve;  the  population  u(t)  grows  and 
approaches  the  environmental  capacity  K\  the  growth  rate  approaches  zero 
and  growth  is  limited. 


40 


1.  The  Physical  Origins  of  Partial  Differential  Equations 


Clearly  this  model  ignores  how  the  organisms  are  distributed  spatially  in 
the  domain.  If  we  want  to  consider  spatial  distribution,  then  the  population  u 
will  depend  not  only  on  time  t  but  also  upon  the  spatial  variable  x  denoting 
its  position  in  the  domain,  where  0  <  x  <  L;  that  is,  u  =  u(x,t).  With  this 
interpretation,  u(x,  t )  is  a  local  population  density  measuring  the  number  of 
organisms  in  a  small  volume  AAx  lying  between  x  and  r  +  Ar.  In  other  words, 
u(x,  t)AAx  is  the  number  of  organisms  in  the  small  volume.  Implicit  in  this 
model  is  that  there  are  enough  organisms  in  the  domain  so  that  defining  a 
local  density  makes  sense.  To  model  population  changes,  we  must  ask  how  it 
evolves  in  both  time  and  space;  therefore  we  expect  a  PDE  model  for  u(x,  t).  If 
we  adopt  a  growth  model,  then  we  must  include  terms  (say,  containing  spatial 
partial  derivatives)  that  indicate  how  the  organisms  move  in  the  domain.  For 
example,  if  the  motion  is  due  to  diffusion,  then  the  PDE  model  with  logistic 
growth  is,  as  may  be  expected, 


Ut 


Duxx  +  ru 


u  =  u(x,  t). 


This  equation  is  called  Fisher’s  equation,  and  it  is  one  of  the  fundamental 
nonlinear  PDEs  of  biology. 

Knowledge  of  the  population  density  u(x,  t)  permits  calculation  of  the  total 
population  in  the  domain  at  time  t  via 


Total  population  at  time  t 


-L 


u(x ,  t)Adx. 


This  discussion  indicates  how  consideration  of  spatial  structure  leads  to  a 
PDE  model.  There  are  other  types  of  structure  besides  spatial  structure.  If  we 
are  interested  in  the  age  of  the  organisms,  then  the  age  density  depends  upon 
the  age  a  and  time  t ;  hence,  u  =  u(a ,  t),  where  u(a ,  t)Aa  represents  the  number 
of  individuals  at  time  t  between  the  ages  a  and  a  +  A  a.  An  important  exercise 
in  life  history  theory  is  to  develop  models  that  predict  the  age  structure  u(a,  t ) 
of  a  population  at  any  time  t  if  the  initial  age  structure  u(a,  0)  is  known  and 
the  birth  and  death  rates  are  given.  These  problems  are  discussed  in  Chapter  5. 
Other  structured  models  might  include  other  tags  or  labels  like  size  or  weight; 
these  are  called  physiologically  structured  models.  A  population  model  that 
includes  both  age  a  and  weight  w  leads  to  a  PDE  model  for  a  function  of 
three  variables,  u  =  u(a,w,t)-  So  the  population  density  depends  upon  the 
organism’s  age,  its  length,  and  time.  Other  PDE  models  in  the  life  sciences 
include  models  for  biochemical  reactions,  transport  of  chemical  species  in  the 
blood,  models  for  tumor  growth,  and  so  on. 

As  in  heat  flow,  there  are  two  fundamental  models  in  biology  that  describe 
transport,  or  motion:  advection  and  diffusion.  We  encountered  these  pro¬ 
cesses  in  Sections  1.2  and  1.3.  In  the  life  sciences,  advection  refers  to  the 
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transport  of  particles,  chemicals,  animals,  or  whatever,  via  bulk  motion  of  the 
supporting  medium,  e.g.,  wind,  water,  blood,  and  so  on.  Migration  of  a  pop¬ 
ulation  is  also  advection.  Diffusion  refers  to  the  random  motion  of  particles, 
chemicals,  animals,  or  whatever,  that  cause  these  entities  to  disperse  from  high 
densities  to  low  densities,  as  discussed  above.  Figure  1.11  compares  time  pro¬ 
files  of  both  diffusion  and  advection  processes,  and  a  combination  of  the  two. 


t  =  T 

x 


Figure  1.11  Spatial  density  profiles  showing  how  an  initial  profile  propagates 
in  time  under  an  advection,  diffusion,  and  an  ad vect ion-diffusion  process 


A  model  of  advection  follows  from  the  conservation  law  derived  in  Sec¬ 
tion  1.2.  To  reiterate,  the  basic  conservation  law  in  one  dimension  relates  the 
time  derivative  of  the  unknown  density  u  =  u(x,t),  the  spatial  derivative  of 
the  unknown  flux  </>  =  </>(#,£),  and  the  source  (or  sink)  /  =  /(ayt).1  It  is  given 
by  equation  (1.8)  as 

Ut  =  —4>x  +  /•  (1-27) 

The  density  can  refer  to  density  of  animals,  chemicals,  or  any  particles  or 
propagules.  The  flux,  a  measure  of  the  flow  rate,  is  usually  dependent  upon  the 
density  through  the  assumption  of  a  constitutive  equation;  different  motions 

Recall  that  the  flux  and  source  may  depend  on  the  density  u  as  well. 
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arise  from  different  assumptions.  As  we  noted,  advection  is  defined  by  the 
relation 

<f>  =  cu, 

where  c  is  the  bulk  speed  of  the  medium  carrying  the  density,  and  diffusion  is 
defined  by  Fick’s  law 

<t>  =  ~Dux, 

where  D  is  the  diffusion  coefficient.  The  diffusion  coefficient  has  dimensions  of 
length-squared  per  time,  and  we  often  use  the  relation  D  =  L2 /T  to  estimate 
the  time  T  that  it  takes  for  a  substance  to  diffuse  a  length  L.  If  c  and  D  are 
constant,  we  can  substitute  these  defining  relations  into  (1.27),  respectively,  to 
obtain  the  fundamental  balance  equations: 

ut  =  —cux  +  /.  (advection  equation  with  source) 

Ut  =  Duxx  +  /.  (diffusion  equation  with  source) 

If  both  advection  and  diffusion  occur,  then  the  flux  is 

<j)  =  —Dux  +  cu, 

and  the  fundamental  balance  law  becomes 

ut  =  Duxx  —  cux  +  /  (advection- diffusion  with  a  source) 

Sources  and  sinks  depend  upon  the  model.  For  population  models  the  source 
term  /  represents  birth  or  growth  rates,  and  a  sink  represents  a  death  rate, 
either  natural  or  by  predation.  For  chemical  problems,  the  source/sink  term  is 
called  a  reaction  term  and  represents  the  rate  that  the  chemical  species  is  cre¬ 
ated  or  consumed  by  chemical  reaction,  or  lost  by  absorption  (say,  across  a  cell 
boundary).  Diffusion  equations  with  source  terms  are  usually  called  reaction- 
diffusion  equations.  Sources  are  positive  terms,  and  sinks  are  negative  terms. 
In  addition,  PDEs  are  always  accompanied  a  relevant  domain  of  space  and 
time  and  by  initial  conditions  and/or  boundary  conditions;  an  initial  condition 
prescribes  the  initial  density  distribution  u(x,  0),  and  a  boundary  condition 
prescribes  what  is  going  on  at  the  boundary  of  the  domain.  In  the  latter  case 
we  may  specify  the  density  u  itself  (a  Dirichlet  condition),  or  we  may  specify 
the  flux  cj)  (a  Neumann  condition).  It  is  important  to  notice  that  the  flux  has 
a  different  form  for  the  three  models  defined  above:  advection,  diffusion,  and 
advection- diffusion. 

In  some  biological  processes  the  advection  speed  c  and  the  diffusion  constant 
may  depend  upon  location  in  the  medium,  and  thus  depend  upon  x,  or  they 
may  depend  upon  the  density  u  itself.  That  is,  c  =  c(x,u)  and  D  =  D(x,u). 
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Dependence  upon  x  signifies  a  heterogeneous  medium  of  propagation.  There¬ 
fore,  the  ad vect ion-diffusion  equation  with  a  source  is 


'U't  —  ( cu)x  +  f  7 

and  the  c  and  D  factors  may  not  be  pulled  out  of  the  spatial  derivatives.  When 
c  or  D  depends  upon  u ,  the  equation  is  nonlinear.  The  source  term  /  can  also 
introduce  nonlinearities. 


Random  Motion 


Now  we  want  to  show  how  this  diffusion  model  arises  from  a  stochastic 
argument  based  upon  random  motion.  Let  X  be  a  random  variable  with  a 
normal  probability  density  function  (the  “bell-shaped  curve” ) 


N(x,  /i,  a) 


1  c~Q-/l)2 /2cr2 


where  fi  is  the  mean,  or  center  of  the  density,  and  a  is  the  standard  deviation, 
a  measure  of  the  spread  about  the  mean.  By  definition,  the  area  under  the 
normal  curve,  N(x,  //,  cr)dx,  is  the  probability  of  X  taking  a  value  between 
a  and  b.  Amazingly  enough,  if  we  take  the  standard  deviation  to  depend  upon 
time  via  a  =  y/2 Dt,  so  that  the  spread  of  the  data  gets  larger  with  time,  then 
we  obtain  the  function 


u(x,  t) 


1  c-(x-n)2 /ADt 

y/far  Dt 


which  is  a  solution  to  the  diffusion  equation!  This  formula  gives  the  solution 
to  the  diffusion  equation  generated  by  a  unit  point  source  given  initially  at 
x  =  /i,  and  it  is  called  the  fundamental  solution  to  the  diffusion  equation,  or 
the  point-source  solution.  That  is,  the  fundamental  solution  is  the  formula 
for  the  time-changing,  spreading  density  distribution  of  a  substance  caused  by 
placing  a  unit  amount  of  the  substance  at  x  =  /x  at  time  t  =  0. 

It  is  not  an  accident  that  there  is  such  a  close  relationship  between  the 
diffusion  equation  and  probabilities.  We  look  in  a  different  direction  to  affirm 
this  relationship. 

Let  x  represent  an  arbitrary  point  on  the  x-axis.  Now  divide  the  axis  into 
equal  segments  of  length  h  so  that  the  entire  axis  is  composed  of  segments  with 
discrete  endpoints  . . . ,  x  —  2 h,  x  —  h,  x,  x  +  h,  x  +  2 h, ....  Let  u  =  u(x,  t)h 
be  the  number  of  particles  (e.g.,  cells,  organisms,  molecules)  in  the  interval 
(x,x  +  h)  at  time  t.  Figure  1.12  is  a  schematic  of  a  typical  histogram  showing 
the  distribution  of  the  assemblage  of  particles  at  time  t.  Now  assume  in  the  next 
small  instant  of  time  r  that  all  the  particles  in  each  interval  move  randomly 
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Figure  1.12  Histogram  where  u(x,t)h  represents  the  number  of  particles  in 
the  interval  (x,  x  +  h)  at  time  t 


to  the  right  or  to  the  left  with  an  equal  probability  of  one-half.  Then  we  can 
calculate  the  particle  distribution  at  the  next  time  t  +  r.  We  get 

Change  in  number  of  particles  in  (x,x  +  h) 

=  number  that  move  in  from  the  left 
+  number  that  move  in  from  the  right 

—  number  that  leave  to  the  left 

—  number  that  leave  to  the  right. 


Here,  by  c number’  we  mean  ‘average  number’.  In  symbols,  after  cancelling  the 
common  factor  h, 

1  1  11 

u(x,  t  +  r)  —  u(x,  t )  =  -u(x  —  h,  t)  +  ~u(x  +  h,  t)  —  —u{x,  t )  —  —u(x,  t). 


This  simplifies  to 


1  1 

u(x,  t  +  r)  =  —u(x  —  h,  t)  H — u(x  +  h,  t). 

2  2 


This  is  a  difference  equation  relating  the  number  of  particles  in  three  adjacent 
intervals  at  two  different  times.  Because  h  and  r  are  small,  we  may  expand  the 
three  terms  of  this  equation  in  Taylor  series  to  obtain 

u(x,  t)  +  ut(x,  t)r  +  0(r2)  =  1  (u{x,  t)  -  ux(x,  t)h  +  ^uxx(x,  t)h 2  +  O (h3) 

+  1  fu(x,  t)  +  ux(x,  t)h  +  ^uxx(x,  t)h 2  +  O (ft3)  )  , 


where  O (r2)  denotes  remaining  terms  of  the  series,  all  of  which  have  a  factor 
of  at  least  r2,  and  O (h3)  denotes  remaining  terms  which  have  a  factor  of  at 
least  h3.2  Simplifying  gives 

2  Terms  like  O (h3)  are  called  order  terms  and  we  read  such  terms  as  an  order  h 3 
term.  Precisely,  an  expression  G(h)  is  order  ha  for  small  h  if  G(h)/ha  remains 

bounded  for  all  h  sufficiently  small,  /i  /  0. 
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h 2  i 

ut(x,  t)  =  —uxx(x,  t )  +  0(r )  +  -0(ft,3). 

ZT  T 

Now  we  take  the  limit  as  r  -G  0  and  h  0  in  such  a  way  that  we  maintain  the 

i  2 

ratio  equal  to  a  hxed  constant  D.  Therefore,  in  this  special  limit,  we  obtain 

utOM)  =  Duxx(x,t), 

which  is  the  diffusion  equation.  (An  exercise  requests  this  same  derivation 
when  the  probability  of  moving  right  or  left  is  p  and  1  —  p,  respectively.  One 
obtains  the  diffusion  equation  with  drift ,  i.e. ,  the  advection-diffusion  equation). 
A  detailed  discussion  of  the  connection  between  diffusion  and  random  motion 
can  be  found  in  Murray  (2003). 

Point-Source  Solutions 

Point-source  solutions  to  linear  diffusion  equations  play  a  special  role  in 
both  applications  and  theory  of  PDEs,  and  so  we  make  a  few  further  comments 
and  catalog  some  other  such  solutions.  A  point-source  solution  are  also  called 
fundamental  solutions,  and  those  that  also  satisfy  given  boundary  conditions 
are  called  Green’s  functions.  As  we  observe  later  in  Chapter  2,  knowledge  of 
the  Green’s  function  for  a  PDE  allows  us  to  solve  the  PDE  with  an  arbitrary 
nonhomogeneous,  or  source  term,  by  simple  superposition.  This  theme  appears 
throughout  the  book. 

For  the  present  however,  we  only  illustrate  some  fundamental  solutions  to 
the  basic  diffusion- type  equations  we  encountered  so  far.  In  one-dimensional 
linear  geometry  we  repeat  our  basic  result.  The  fundamental  solution  to  the 
diffusion  equation 


'U't  —  DUxxi 

x  G  R,  t  >  0, 

(1.28) 

G(x,  /i,  t)  = 

1  p-(x-n)2 /ADt 

(1.29) 

\J  4nDt 

In  an  advection-diffusion  or  growth-diffusion  process  is  present,  we  can  also 
obtain  special  solutions  that  represent  the  density  distribution  arising  from  an 
initial,  unit  point  source  at  x  =  (i.  For  the  advection-diffusion  equation 

itt  —  D^xx  ciixi  x  G  M,  t  0,  (1.30) 


the  fundamental  solution  is 


u  =  G(x  —  ct ,  /i,  t) 


_  — ( x  —  ii—ct)2  /  ADt 

\/  4tt  Dt 


while  for  the  growth-diffusion  equation 


(1.31) 


ut  =  Duxx  +7 u,  x  G  R,  t  >  0, 


(1.32) 
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the  fundamental  solution  is 


u  =  e7tG(  x,n,t)  =  e-G-M)2/4  Dt+1t. 

V  47t  Dt 

Here  is  a  example  of  an  application  in  biology. 


(1.33) 


Example  1.21 


(Invasion  speed)  Consider  a  very  long  canal  with  still  water  where  a  non- 
indigenous  species  of  algae  is  accidentally  released  at  the  point  x  =  0.  After 
a  long  period  of  time  it  is  observed  that  the  speed  of  the  biological  invasion 
outward  in  both  directions  from  x  =  0  is  nearly  constant.  We  show  that  this 
observation  can  be  explained  by  a  diffusion-growth  model.  The  point-source 
solution  to  diffusion-growth  model  is  given  above.  Let  x  =  Xf(t)  denote  the 
position  of  the  wavefront,  defined  by  the  position  where  the  population  density 
is  u  =  Uf ,  where  Uf  is  a  small,  given  value  of  the  density.  Then,  from  the 
fundamental  solution 


1  c~Xf(t)2/4:Dt+'yt 

y/4 7rDt 


Uf. 


Taking  logarithms  gives 


In  (a/47 rDtUf). 


Now  we  make  a  clever  approximation.  For  large  times  t  the  right  side  gets  large, 
but  not  as  large  as  the  yt  term  (recall  that  t  is  much  larger  than  lnt).  Therefore, 
the  only  two  terms  in  this  equation  that  can  balance  for  large  t  are  the  two  on 
the  left  side.  Thus  the  long  time,  approximate  position  of  the  invasion  front  is 


Xf(t )  ~  yj A'yDt. 


Consequently,  the  speed  of  the  front  approaches  a  constant  value  y/A^D.  □ 


The  diffusion  equation  in  two  dimensions  with  radial  symmetry,  where 
the  density  u  =  u(r,  t)  depends  only  of  the  distance  r  from  the  origin  and  time, 
is  (see  Exercise  12) 


1 

Ut  =  D  —  (rur)r. 
r 

The  fundamental  solution,  with  the  unit  source  at  the  origin, 


(1.34) 


u{r ,  t) 


____  —  r 2 /ADt 

4nDt 
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In  three  dimensions,  the  diffusion  equation  with  spherical  symmetry,  where 
u  =  u(p,  t)  depends  only  upon  the  distance  p  from  the  origin  and  time,  is 


l 

Ut  —  D  —  (p  Up)p. 
P 


(1.35) 


In  this  case  the  point-source  solution  is,  with  the  unit  source  at  the  origin, 


u(p,t) 


1 


(47r  Dt)3/2 


-p2/4Dt 


A  major  difference  between  point-source  solutions  in  linear,  radial,  and  spher¬ 
ical  symmetry  is  the  decaying,  time-dependent  amplitude  factor  appearing  in 
front  of  the  exponential  term;  the  decay  factors  are  1  /y/t,  1/t,  and  1/t3/2, 
respectively. 


EXERCISES 

1.  In  Fick’s  law  assume  the  diffusion  coefficient  D  is  a  function  of  the  density 
u ,  i.e.,  D  =  D(u).  Show  that  the  diffusion  equation  without  sources  can  be 
written 

Ut  —  D{u)uxx  ~\~  D  (iPjux. 

2.  From  the  diffusion  equation  with  no  sources,  find  the  steady-state  concen¬ 
tration  w  =  w(x)  of  a  chemical  in  a  tube  of  length  L  =  2  with  diffusion 
coefficient  D  and  subject  to  boundary  conditions  u(0)  =  4  and  u'{ 2)  =  1. 
Explain  these  conditions  in  terms  of  chemical  concentrations  and  fluxes. 
Under  the  same  boundary  conditions,  what  is  the  steady-state  concentra¬ 
tion  if  D  depends  on  x,  e.g.,  D  =  (1-f-x)-1?  What  is  the  steady  state 
solution  if  D  depends  on  u ,  e.g.,  D  =  ul 

3.  Considering  all  cases,  find  the  form  of  steady- state,  or  time-independent, 
solutions  to  the  advection-diffusion  equation 

Ut  =  Duxx  —  cux 

and  the  advection-diffusion-growth  equation 

Ut  =  Duxx  —  cux  +  ru. 

4.  An  environmentally  toxic  radioactive  chemical  is  continually  released  at  a 
constant  rate  of  1  (mg  per  vol  per  time)  at  the  midpoint  of  a  canal  of  length 
L  with  still  water.  As  it  diffuses  through  the  canal  with  diffusion  constant 
D  —  1,  it  decays  at  rate  A  (per  unit  time)  The  ends  of  the  canal  are  con¬ 
nected  to  large  bodies  of  toxic- free  water.  Set  up  the  model  equations  and 
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the  boundary  conditions.  Find  the  steady-state  concentration  and  sketch 
its  spatial  profile  for  different  values  of  L  and  A.  Hint:  Break  the  problem 
up  into  two  parts,  on  each  side  of  the  source.  At  the  source  the  concentra¬ 
tion  must  be  continuous,  and  in  a  small  interval  about  the  source,  the  ‘flux 
in’  minus  the  ‘flux  out’  equals  one. 

5.  Work  the  preceding  problem  if  there  is  no  decay  and  if  the  source  is  located 
at  x  =  £,  0  <  £  <  L. 

6.  Consider  the  ad vect ion-diffusion  equation  on  an  interval  0  <  x  <  L.  Show 
that  if  the  flux  at  x  =  0  equals  the  flux  at  x  =  L,  then  the  density  is 
constant. 

7.  The  population  density  of  zooplankton  in  a  deep  lake  varies  as  a  function 
of  depth  x  >  0  and  time  t  (x  =  0  is  the  surface).  Zooplankton  diffuse 
vertically  with  diffusion  constant  D  and  buoyancy  effects  cause  them  to 
migrate  toward  the  surface  with  an  advection  speed  of  ag,  where  g  is  the 
acceleration  due  to  gravity.  Ignore  birth  and  death  rates.  Find  a  PDE  model 
for  the  population  density  of  zooplankton  in  the  lake,  along  with  the  the 
appropriate  boundary  conditions  at  x  =  0  and  x  =  Too.  Find  the  steady- 
state  population  density  for  zooplankton  as  a  function  of  depth,  and  sketch 
its  graph.  Note  that  the  flux  is  advective  and  diffusive. 

8.  Sucrose  in  water  has  a  diffusion  coefficient  of  4.6  x  10-6  cm2  /  sec .  Estimate 
how  far  a  concentrated  amount  of  sucrose  would  diffuse  in  1  day?  A  certain 
insect  pheromone  diffuses  in  air  with  diffusion  constant  2.0  x  10“ 1  cm2  /  sec . 
Estimate  the  time  it  takes  for  the  pheromone  to  diffuse  a  distance  of  100  m. 
Can  diffusion  alone  account  for  sexual  attraction  of  mates  for  this  insect? 

9.  Show  that  the  steady-state  solutions  to  the  diffusion  equation  in  linear 
geometry,  in  planar  geometry  with  radial  symmetry,  and  in  spatial  geom¬ 
etry  with  spherical  symmetry  are,  respectively, 

u  =  ax  +  6, 

u  =  a  In  r  +  6, 
a 

u  —  —  T  b. 

P 

Here,  a  and  b  are  arbitrary  constants. 

10.  Repeat  the  random  walk  derivation  in  this  section  in  the  case  that  the 
probability  of  moving  right  or  left  is  p  and  1  —  p,  respectively,  rather  than 
one-half  in  either  direction.  Show  that  under  appropriate  limit  definitions 
as  h  — >  0  and  r  0,  one  obtains  the  advection- diffusion  equation. 
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11.  Muskrats  were  accidentally  introduced  in  Europe  in  1905  and  the  biological 
invasion  spread  approximately  radially  in  all  directions.  Let  Uf  define  a 
predetermined  magnitude  of  the  population  density  on  a  circle  of  radius 
r  =  r f{t)  as  the  front  spreads.  Use  a  diffusion-growth  model  to  show  that 
the  speed  of  the  wave  front  is  approximately  constant  for  large  times  t. 


12.  (a)  Derive  the  radially  symmetric  diffusion  equation 


Ut 


D  —  (rur) 
r 


r 


from  basic  principles  using  the  following  strategy.  Take  two  circles  of  radius 
r  =  a  and  r  =  b  and  write  down  the  conservation  law  in  polar  coordinates 
for  the  density  in  the  annular  region  in  between  (recall  that  an  area  element 
in  polar  coordinates  is  rdrdO  and  that  the  radial  flux  is  —Dur).  (b)  Do  the 
same  for  spherically  symmetric  diffusion 


lit 


1 


D~o  (Ap) 


P 


P 


by  considering  two  concentric  spheres. 


1.5  Vibrations  and  Acoustics 

Wave  motion  is  one  of  the  most  commonly  occurring  phenomenon  in  nature.  We 
mention  electromagnetic  waves,  water  waves,  sound  and  acoustic  waves,  and 
stress  waves  in  solids  as  typical  examples.  In  this  section  we  begin  with  a  simple 
model  to  introduce  the  concept  of  wave  motion  and  one  of  the  fundamental 
PDEs,  the  wave  equation,  that  describe  such  phenomena.  The  example  is  the 
small,  transverse  vibrations  of  a  flexible,  elastic  string,  e.g.,  a  guitar  string. 
Then  we  discuss  wave  motion  in  the  context  of  acoustics. 

Vibrations  of  a  String 

Let  us  imagine  a  taut  string  of  length  l  fastened  at  its  ends.  Its  motion  is 
described  by  a  function  u  =  u(x,t),  which  defines  the  vertical  displacement  of 
each  point  x  of  the  string  at  time  t.  Our  basic  postulate  is  that  the  displacement 
u  is  small.  Implicitly,  we  assume  that  the  motion  is  in  a  plane  and  no  element 
of  the  string  moves  horizontally — only  vertically.  At  each  instant  of  time  we 
assume  that  the  string  has  a  density  p(x,t),  with  dimensions  mass  per  unit 
length,  and  the  tension  in  the  string  is  given  by  a  function  T(x,t),  with  force 
dimensions.  By  convention,  T(x,t)  is  the  force  on  the  segment  to  the  left  of  x 
caused  by  the  portion  of  the  string  to  the  right  of  x,  and  we  assume  that  the 
tension  always  is  directed  along  the  tangent  to  the  displacement  profile  at  x. 
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Figure  1.13  Displaced  string  with  tension  forces  shown 


This  latter  assumption  implies  that  the  string  does  not  resist  bending.  A  priori, 
we  do  not  know  p  or  T.  Figure  1.13  shows  an  arbitrary  segment  of  the  string 
between  x  =  a  and  x  =  b.  We  denote  the  angle  that  the  tangent  makes  with 
the  horizontal  by  #(x,£),  and  we  observe  that 

tan  6(x,t)  =  ux{x,t). 


To  obtain  an  equation  of  motion  of  the  string,  we  appeal  to  mass  balance  and 
Newton’s  second  law.  First,  mass  balance  implies  that  the  mass  of  the  segment 
at  any  time  t  must  equal  its  mass  in  equilibrium  position  (which  we  take  to  be 
at  t  =  0).  In  symbols,  this  means 


X,  t )  y/l  +  Ux{x ,  t )2  dx 


po(x)  dx , 


where  po  is  the  density  of  the  string  in  equilibrium.  (Note  that 


a/1  +  ux(x ,  t )2  dx 


is  an  element  of  arc  length.)  But  this  equation  must  hold  for  every  segment 
a,  6],  and  therefore  we  may  equate  the  integrands  to  obtain 

p(x,t)\/ 1  +  ux(x,t)2  =  po(x).  (1.36) 


Next  let  us  apply  Newton’s  second  law,  which  states  that  the  time  rate  of 
change  of  the  total  momentum  of  the  segment  must  equal  the  net  external 
force.  We  shall  assume  that  the  only  force  acting  on  the  segment  is  tension 
caused  by  the  rest  of  the  string;  we  ignore  gravity  and  any  damping  forces — see 
Exercises  1  and  2.  Because  there  is  no  horizontal  motion  of  the  segment,  the 
horizontal  forces  must  balance,  or 


T(6,  t )  cos  0(b,  t )  =  T(a,  t )  cos0(a,  t). 
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Because  this  equation  holds  for  all  a  and  6,  we  have 

T(x,  t )  cos  Q(x,  t )  =  r(t) 

for  some  function  r.  Now,  the  time  rate  of  change  of  the  total  momentum  of 
the  segment  must  equal  the  net  vertical  force.  That  is, 

d  rb  _ 

—  y  p(x,t)ut(x,t)\/l  +  uj^tf^dx 

=  T(6,  t)  sin  Q(b,  t )  —  T(a,  t)  sin  0(a,  t). 

Using  the  result  of  equation  (1.36)  and  bringing  the  derivative  inside  the  inte¬ 
gral  sign  gives 

po(x)utt(Xi  t)dx  =  T(b,  t)  sin 6{b,  t)  —  T(a,  t)  sin^(a,  t). 


a 


We  observe  that  the  right  side  of  this  equation  can  be  written  as 

T(b,t)  cos 6(b,  t) [tan  6(b,  t)  —  tan 0(a,  £)], 


or 


r(t)(ux(b,t)  -  ux(a,  t)) 


Therefore, 


Po(x)utt(x,  t)dx  =  T(t)(ux(b ,  t)  -  ux(a,  t)) 


a 


By  the  fundamental  theorem  of  calculus  we  can  rewrite  the  right  side  as  an 
integral,  and  we  obtain 

•6  rb 

Po(x)utt(x,  t)dx  =  r(t)  /  uxx(x,  t)dx. 


a 


a 


Again  using  the  arbitrariness  of  the  interval  of  integration  [a,  6],  we  deduce 

Po(x)utt(x,t)  =  r(t)uxx(x,t),  0  <  x  <  /,  t  >  0. 

As  an  additional  constitutive  assumption,  we  assume  that  the  tension  r(t)  = 
To  =  constant,  which  is  a  good  assumption  for  small  displacements  of  the  string. 
Thus,  in  the  framework  of  the  assumptions  that  we  have  made,  we  obtain  a 
model  equation  for  the  transverse  displacements,  namely, 


—  7"oTx 


X' 


(1.37) 


If  we  introduce  the  function 


Co(x)  =  dnjpoix), 
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then  we  can  write  (1.37)  as 

Utt  —  Co(^)  xx •  (1.38) 

One  should  observe  that  Co  has  dimensions  of  speed.  It  is  called  the  wave  speed , 
and  we  shall  observe  that  it  measures  the  actual  speed  that  waves  travel  along 
the  string.  In  many  applications  the  density  of  the  string  is  a  constant,  and 
hence  Co  is  constant;  in  that  case  equation  (1.38)  becomes 

'Utt  —  (1.39) 

which  is  called  the  wave  equation;  it  is  one  of  the  fundamental  equations  in 
mathematics  and  applications. 

If  Co  is  constant,  it  is  easy  to  check  (Exercise  3)  that  u  =  F(x  —  cot)  and 
u  =  G(x  +  cot)  are  solutions  of  the  wave  equation  for  any  twice  continuously 
differentiable  functions  F  and  G;  thus  the  wave  equation  admits  both  right- 
and  left-traveling  wave  solutions.  In  fact,  we  shall  show  later  that  the  general 
solution  of  the  wave  equation  is  the  superposition  (sum)  of  left-  and  right- 
traveling  waves. 

Our  original  assumption  that  the  string  is  fastened  at  its  ends  x  =  0  and 
x  =  l  translates  into  boundary  conditions 

'iz(0,  t)  =  0,  u(l,t)  =  0,  t  >  0.  (1-40) 

To  determine  a  unique  displacement,  we  must  impose  initial  conditions  as 
well;  that  is,  we  prescribe  both  the  position  of  the  string  and  its  velocity  at 
time  t  =  0.  Symbolically, 

u(x,Q)  =  f(x),  Ut(x,  0)  =  g(x),  0<x<L  (1.41) 

In  summary,  to  determine  the  displacement  u(x,t),  we  must  solve  the  initial 
boundary  value  problem  consisting  of  the  PDE  (1.39)  subject  to  the  boundary 
conditions  (1.40)  and  initial  conditions  (1.41).  We  carry  out  this  calculation 
in  Chapter  2.  We  point  out  that  this  model  is  a  small-displacement  theory. 
A  more  complicated  nonlinear  model  is  required  if  we  want  to  describe  large 
displacements. 

One- Dimensional  Acoustics * 

Acoustics  is  the  science  of  sound  and  how  signals  are  transmitted  through 
a  medium.  We  show  that  small  disturbances,  or  sound  signals,  propagate  as 
waves  and  are  governed  by  the  wave  equation. 

We  restrict  our  attention  to  sound  waves  in  a  tube  of  cross-sectional  area  A, 
as  in  Figure  1.3,  filled  with  a  gas.  In  a  fixed  cross-section  we  assume  no  variation 
in  any  of  the  variables;  the  state  variables  will  depend  on  x,  the  distance  along 
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the  tube,  and  time  t.  Thus  our  treatment  is  strictly  one-dimensional.  The  state 
variables  are  mass  density  p  =  p(x,£),  velocity  v  =  v(x,t),  and  pressure  p  = 
p(x,t).  Here,  the  velocity  is  the  actual  velocity  of  the  air  particles  as  measured 
by  a  laboratory  observer;  the  pressure  is  the  force  per  unit  area  on  the  air  to 
the  right  of  the  section  at  x,  caused  by  the  air  to  the  left  of  the  section  at  x. 
The  starting  place  for  obtaining  equations  for  acoustics  is  the  basic  balance 

law 

Ut  +  4>x  =  / 

introduced  in  Section  1.2.  Here  u  is  a  density  (quantity  per  unit  volume),  <p  is 
the  flux  (quantity  per  unit  area  per  unit  time) ,  and  /  is  a  source  (quantity  per 
unit  volume  per  unit  time).  First,  in  gas  flow  we  must  balance  mass.  Therefore, 
take  u  —  p,  the  mass  density,  and  </>  =  pv,  which  is  the  mass  flux.  There  are  no 
sources  that  create  or  destroy  mass,  so  /  =  0.  Then  mass  balance  is  expressed 
by  the  equation 

pt  +  (pv)x  =  0-  (1-42) 

Second,  momentum  must  balance.  Therefore,  take  u  =  pv,  which  is  the  momen¬ 
tum  density  (the  same  as  the  mass  flux,  in  units  of  mass  per  area  per  time,  or 
mass  times  velocity  per  unit  volume),  and  take  =  (pv)v,  which  is  the  momen¬ 
tum  flux,  or  the  rate  at  which  momentum  flows  through  a  cross-section.  But 
now  there  is  a  source  term  /  because  pressure  gradients  across  the  tube  can 
create  a  net  force  that  can  change  momentum  (this  is  Newton’s  second  law). 
So  we  take  /  =  —  px,  and  the  momentum  balance  law  becomes 

(pv)t  +  ( pv2)x  =  -px. 

The  right  side  is  a  force  per  unit  volume,  which  is  the  same  as  a  momentum 
per  unit  volume  per  unit  time;  so  it  is  like  a  momentum  source  term.  The  term 
is  negative  because  a  negative  pressure  gradient  should  cause  motion  to  the 
right.  We  leave  it  as  an  exercise  to  show  that  the  momentum  balance  equation 
can  be  written,  using  (1.42),  as 


pvt  +  pvvx  +px  =  0. 


(1.43) 


Equations  (1.42)  and  (1.43)  are  the  governing  equations  for  gas  flow  in  a 
tube.  The  equations  are  nonlinear  and  contain  three  unknowns,  p,u,  and  p.  For 
a  third  equation  we  assume  a  constitutive  relation,  or  equation  of  state,  of 
the  form 

P  =  F(p). 


That  is,  the  pressure  is  a  function  of  density.  We  will  assume  that  F'(p)  >  0, 
or  pressure  increases  with  density.  A  typical  assumption  is  p  =  kp1 ,  where 
k  >  0,7  >  1.  (If  the  equation  of  state  should  also  depend  on  temperature,  as 
in  an  ideal  gas,  then  we  would  require  yet  another  equation:  energy  balance.) 
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Much  of  acoustical  science  deals  with  small  disturbances  in  the  gas.  This 
leads  to  a  simplified,  linear  approximation  of  equations  (1.42)-(1.43).  Let  us 
assume  that  the  gas  in  the  tube  is  at  rest,  in  a  constant  ambient  state  p  = 
po,  v  =  0,  po  =  F(po).  Then  we  make  a  small  disturbance,  say  at  one  end,  and 
let  p  =  po  +  p(ay  t)  and  v  =  v(x,  t)  be  the  resulting  density  and  velocity,  where 
p  and  v  are  small  deviations  from  the  ambient  state.  For  the  pressure  we  have, 
by  a  Taylor  expansion, 

p  =  F{p0  +  p)  =  F{p0 )  +  F\p)p  H - =  po  +  c2p  H - , 


where 

C  =  y/F'(po) 

is  called  the  sound  speed,  and  the  three  dots  denote  terms  containing  at  least 
a  p2  factor.  Note  that  c  has  velocity  units  because  it  is  the  square  root  of  a 
derivative  of  pressure  with  respect  to  density.  Substituting  all  these  expressions 
into  the  mass  and  momentum  balance  laws  (1.42)-(1.43)  yields 

{po  +  p)t  +  {{po  +  P)v)x  =  0, 

{po  +  p)vt  +  {po  +  p)vvx  +  {Po  +  c2  P  +  •  •  •  )x  =0. 

Now  we  linearize  these  two  equations  by  discarding  all  products  of  small  terms; 
the  idea  is  that  products  of  small  terms  are  smaller  than  the  small  terms  them¬ 
selves.  Therefore,  we  obtain  the  approximation 

Pt  +  PoVx  =  0,  p0vt  +  c2px  =  0.  (1-44) 

These  linearized  equations  are  called  the  acoustic  approximation  equations. 
To  repeat,  p  and  v  are  the  small  density  and  velocity  deviations  from  an  ambient 
state  po  and  v  =  0. 

Next  take  the  t-derivative  of  the  first  equation  in  (1.44)  and  the  ^-derivative 
of  the  second  equation  and  then  subtract  the  results;  we  get 

Ptt  C  Pxx  —  0* 

This  is  the  wave  equation  for  the  small  density  deviation  p.  Similarly, 

Vtt  £  VXx  —  0. 

In  other  words,  small  disturbances  in  the  gas  are  propagated  by  the  wave  equa¬ 
tion.  We  recall  that  the  wave  equation  admits  solutions  of  the  form  F{x  —  ct) 
and  G{x  +  ct),  or  right-  and  left-traveling  waves  moving  at  speed  c.  therefore, 
small  density  or  velocity  variations  can  move  to  the  right,  undistorted,  at  speed 
c.  Large-amplitude  disturbances,  for  example  caused  by  an  explosion,  do  not 
propagate  in  this  way,  and  the  full  nonlinear  theory  embodied  in  (1.42)-(1.43) 
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must  be  applied.  In  this  case  waves  can  distort,  and  discontinuous  solutions, 
called  shock  waves,  can  be  produced.  This  is  the  same  type  of  behavior  we  saw 
in  Section  1.2  for  nonlinear  advection  equations.  See  Logan  (2008,  2013)  for 
detailed  discussions. 


EXERCISES 

1.  In  the  derivation  of  the  wave  equation  we  assumed  no  gravitational  force 
on  the  string.  What  is  the  model  equation  if  gravity  is  included?  If  gravity 
acts  at  each  point  of  the  string,  derive  the  equation 

'U'tt  —  9 1 

where  g  is  the  constant  acceleration  due  to  gravity. 

2.  Repeat  the  derivation  in  this  section  when  the  vertical  motion  of  the  string 
is  retarded  by  a  damping  force  proportional  to  the  velocity  of  the  string. 

Obtain  the  damped  wave  equation 

'U'tt  —  Co(^r)  'Uxx  ku^^ 

where  k  is  the  constant  damping  coefficient. 

3.  Let  Co  be  constant.  Verify  that  u  =  F(x  —  c$t)  and  u  =  G(x  +  c^t)  are 
solutions  of  (1.39)  for  any  twice-differentiable  functions  F  and  G.  If  F(x)  = 
1/(1  +  x2),  sketch  time  profiles  of  the  wave 

1 

u(x,  t )  =  —(F(x  —  t)  +  F(pc  +  t)) 

2 

at  t  =  0,  1,  3,  5. 

4.  Consider  the  displacements  of  a  string  governed  by  the  wave  equation 

'Utt  —  ^  ^xxt  0  ^  x 

where  c  is  constant,  and  subject  to  the  boundary  conditions 

u{ 0,  t)  =  0,  u(l,t)  =  0,  t  >  0. 

Show  that  for  any  positive  integer  n  there  is  a  solution  of  the  form 

nnct  nnx 

—  cos  — - —  sin  — - — . 

6  L 

What  are  the  initial  conditions?  Describe  the  motion  of  the  string  in  the 
cases  n  =  1  and  n  =  2  (sketch  several  fixed  time  snapshots  of  the  string 


un(x,t ) 
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profile).  In  general,  for  any  n,  what  is  the  temporal  frequency  of  the  oscilla¬ 
tions  represented  by  these  solutions?  These  frequencies  are  called  the  fun¬ 
damental  frequencies,  and  the  corresponding  displacements  are  called 
the  fundamental  modes.  How  do  the  frequencies  change  as  the  length  l 
of  the  string  is  changed?  How  do  the  frequencies  depend  on  the  tension? 
Discuss  your  answers  in  the  context  of  a  vibrating  guitar  string.  These 
special  solutions  un{x,t)  are  called  standing  waves. 


5.  The  total  energy  of  the  string  governed  by  equation  (1.37)  with  boundary 
conditions  (1.40)  is  defined  by 


m 


1  2  1  2 

^Pout  +  2  T°Ux 


dx. 


Show  that  the  total  energy  is  constant  for  all  t  >  0.  Hint:  Multiply  (1.37) 
by  ut  and  note  that  (u^)t  =  2 ututt  and  ( utux)x  =  utuxx  +  utxux.  Then 
show 


d_ 

dt 


pou^dx  =  r0utux 


i 

o 


6.  If  the  equation  of  state  has  the  form  p  =  F{p)  =  kp1 ,  find  an  expression 
for  the  sound  speed  c.  Using  this  equation  of  state  with  7  =  1.4,  compute 
the  speed  that  sound  waves  travel  in  air.  Take  the  density  of  air  to  be 
0.0013  gm  per  cubic  cm. 

7.  Using  the  mass  balance  equation  (1.42),  show  that  the  momentum  balance 
equation  {pv)t  +  ( pv2)x  =  —  px  can  be  written  in  the  form  (1.43). 

8.  At  the  end  {pc  =  0)  of  a  long  tube  {x  >  0)  the  density  of  air  changes 
according  to  the  formula  p{ 0,  t)  =  1  —  cos  2 1  for  t  >  0,  and  p{ 0,  t)  =  0  for 
t  <  0.  Find  a  solution  to  the  wave  equation  in  the  domain  x>0,  —00  <  t  < 
00,  in  the  form  of  a  right-traveling  wave  that  satisfies  the  given  boundary 
condition.  Take  c  =  1  and  plot  the  solution  surface. 

9.  (Transmission  line)  In  a  long  transmission  line  the  voltage  V  =  U(x,  t)  and 
current  I  =  I{x,t)  (per  unit  length)  satisfy  the  transmission  line  equations 

Ix  +  CVt  +  GV  =  0,  Vx  +  LIt  +  RI  =  0, 


where  C  is  the  capacitance,  G  is  the  leakage,  R  is  the  resistance,  and  L  is 
the  inductance  (all  per  unit  length)  of  the  line.  Show  that  both  V  and  I 
satisfy  the  telegraph  equation 

LCutt  +  (RC  +  LG)ut  +  RGu  =  uxx. 

Note  that  if  R  =  G  =  0,  then  I  and  V  both  satisfy  the  wave  equation. 
What  is  the  speed  of  waves? 
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One  of  the  fundamental  equations  of  mathematical  physics  is  the  Schrodinger 
equation,  which  is  the  basic  equation  of  quantum  mechanics.  In  the  next  few 
paragraphs  we  give  a  brief  description  of  Schrodinger’s  equation  in  one  dimen¬ 
sion. 

Consider  a  particle  of  mass  m  moving  on  the  x-axis  under  the  influence  of 
a  continuous,  conservative  force  F(x),  which  depends  only  on  the  position  x. 
According  to  the  canon  of  classical  particle  physics,  the  motion  x  =  x(t)  of  the 
particle  is  governed  by  the  dynamical  equation 

m~d^  =  F^'  (L45) 


which  is  Newton’s  second  law  of  motion  (mass  times  acceleration  equals  force). 
If  the  initial  position  x(0)  and  initial  velocity  dx/dt  at  t  =  0  are  prescribed,  then 
one  can  in  theory  solve  the  ODE  model  (1.45)  subject  to  the  initial  conditions 
and  actually  determine  the  state  of  the  particle,  that  is,  its  position  and  velocity, 
for  all  times  t  >  0.  In  this  sense,  classical  mechanics  is  a  deterministic  model- 
knowledge  of  the  initial  state  determines  the  future  history  of  the  particle. 

In  the  early  1900s  it  was  found  that  this  classical,  deterministic  model  of 
motion  fails  on  the  atomic  scale.  Quantum  mechanics,  a  probabilistic  model, 
grew  out  of  the  atomic  physics  revolution.  Quantum  theory  dictates  that  the 
particle  has  no  definite  position  or  velocity;  rather,  one  postulates  a  statistical, 
or  probabilistic,  interpretation  of  the  state  of  the  particle  in  terms  of  a  wave 
function  T(x,  t).  The  square  of  the  wave  function,  |T|2,  is  a  probability  density 
for  the  random  position  of  the  particle;  that  is, 

nb 


T(x,  t)\2dx 


is  interpreted  as  the  probability  of  the  particle  being  in  the  interval  a  <  x  <  b 
at  time  t.  Thus  f 


oo 

-oo 


\d*(x,t)\2dx  =  1,  since  the  particle  is  located  somewhere 
on  the  x-axis.  From  elementary  probability  theory  it  is  known  that  the  prob¬ 
ability  density  |T(x,t)|2  contains  all  of  the  statistical  information  for  a  given 
problem  (for  example,  the  mean  and  variance  of  the  position).  Note  that  4/  may 
be  complex- valued,  and  thus  |T|2  =  TT,  where  the  overbar  denotes  complex 
conjugation. 

So  the  question  to  finding  the  wave  function.  The  equation  that  governs  the 
evolution  of  a  quantum-mechanical  system  (the  analogue  of  (1.45)  for  a  classical 
system)  is  the  Schrodinger  equation,  a  second-order  partial  differential  equation 
having  the  form 
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where  V  =  V(x)  is  the  potential  energy,  m  is  the  mass,  and  h  =  h/( 27t),  where 
h  =  6.625  •  ICR34  kg  m2/s  is  Planck’s  constant.  (Recall  that  associated  with  the 
force  F(x)  is  a  potential  function  V (x)  defined  by  the  equation  F(x)  =  —V'(x); 
that  is,  the  force  is  the  negative  gradient  of  the  potential.)  One  can  motivate 
the  Schrodinger  equation  from  momentum  and  energy  considerations,  but  here 
our  goal  is  only  to  note  the  form  of  the  equation.  The  reader  is  referred  to  one  of 
the  many  excellent  texts  on  quantum  mechanics  for  a  complete  discussion  (e.g., 
Griffiths  (2005)).  A  popular  equation  studied  in  detail  in  the  mathematical 
literature  is  the  free  Schrodinger  equation 

Vt  =  i^xx,  (1-47) 

where  the  potential  V  is  taken  to  be  zero  (a  free  particle  with  no  forces  acting); 
the  constants  are  taken  to  be  unity.  Note  that  this  equation  resembles  the 
diffusion  equation,  but  it  has  a  complex  coefficient;  this  simple  alteration  makes 
solutions  of  the  two  equations  quite  different. 

One  method  to  find  solutions  of  (1.46)  (we  shall  observe  later  that  this 
method  is  basic  in  PDEs)  is  to  assume  that  the  variables  separate,  i.e.,  the 
solution  has  the  form  of  a  product  T(x,£)  =  y(x)cf)(t).  Substituting  this  into 
(1.46)  gives,  after  rearrangement, 

ih<t>'(t)  _  +  V(x)y(x) 

<i>(t)  y{x) 

Here,  prime  denotes  the  derivative.  The  left  side  of  this  equation  depends  only 
on  t,  and  the  right  side  depends  only  on  x.  The  only  way  equality  can  occur 
for  all  t  and  all  x  is  if  both  sides  are  equal  to  the  same  constant,  which  we  call 
E  (for  energy).  Therefore,  we  obtain  two  equations,  one  for  </>, 

^  =  (~iE/h)<f>,  (1.48) 

and  one  for  ?/,  namely, 

-lf-v"  +  (V(x)-E)y  =  0.  (1.49) 

2  m 

The  solution  of  (1.48)  is  easily  found  to  be 

<p  =  C  exp(— iEt/h)  =  C(cos(—Et/h)  —  i  sin (Et/h)), 

where  C  is  a  constant.  Thus,  the  temporal  part  is  oscillatory  (periodic).  Equa¬ 
tion  (1.49),  whose  solution  y{x)  gives  the  spatial  part  of  the  wave  function,  is 
called  the  time-independent  Schrodinger  equation,  and  it  is  one  of  the  funda¬ 
mental  equations  of  mathematical  physics.  The  values  of  E  for  which  (1.49)  has 
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a  nontrivial  solution  y(x)  with  J^°ooy(x)2dx  =  1  are  interpreted  as  the  allow¬ 
able  energy  levels.  This  is  in  contrast  to  the  situation  in  classical  mechanics, 
where  any  energy  level  is  possible. 


EXERCISES 


1.  Show  that  the  ordinary  differential  equation  y"  +  p(x)y'  +  q(x)y  =  0  can 
be  transformed  into  the  Schrodinger-like  equation  u"  +  r(x)u  =  0,  r  = 
q  —  p' / 2  —  p2  / 4,  without  a  first  derivative,  by  the  Liouville  transformation 


2.  If  the  potential  is  V(x)  =  |x2,  then  for  a  special  value  of  E,  equation 

2 

(1.49)  admits  a  solution  of  the  form  y{pc)  =  Ce~ax  for  some  a  >  0  and 
any  constant  C.  Determine  a  and  E,  and  sketch  a  graph  of  the  probability 
density  function  |4/|2  in  the  case  k  —  m  —  K  (use  a  calculator  to  compute 
C).  Numerically  calculate  the  probability  that  the  particle  is  in  the  interval 
[0,1]. 


3.  Suppose  4/(x,£),  x  E  R,  t  >  0,  is  a  twice  continuously  differentiable 


X 


function  that  satisfies  (1.46),  and  both  |T 
that 

fOO 

|T(x,t)|2dx=  constant,  t  >  0. 


0  as  \x\ 


oo.  Show 


■oo 


Hint:  Take  the  time  derivative  of  the  left  side,  pulling  it  under  the  inte¬ 
gral;  use  the  Schrodinger  equation  and  its  complex  conjugate,  and  finally 
integrate  the  terms  like  by  parts. 


4.  A  free  particle  of  mass  m  is  confined  to  the  interval  0  <  x  <  7r,  and 
4/(0,  t)  =  4/(7r,£)  =  0  for  all  t.  Show  that  the  associated  time-independent 
problem  is 

„  2  mE 

y  +  —fi2~y  =  °>  0  <  x  <  tt;  2/(0)  =  y{ n)  =  0. 

Show  that  this  problem  has  a  nontrivial  solution  if,  and  only  if,  E  is  one 
of  the  values  n2^2/(2m),  n  =  1,2,3,....  Sketch  the  probability  density 
functions  y{x)2  corresponding  to  n  =  1  and  n  =  2,  and  compute  the 
probability  that  the  particle  is  in  the  interval  [0,  0.25]  in  each  case. 
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1.7  Heat  Conduction  in  Higher  Dimensions 

Most  physical  problems  occur  in  more  than  one  spatial  dimension.  In  this  sec¬ 
tion  we  set  up  a  PDE  model  for  heat  flow  in  three  dimensions.  The  same  ideas 
carry  over  to  diffusion  of  chemicals  or  other  biological  quantities.  This  section 
may  require  some  review  of  the  concepts  and  definitions  from  multi-variable 
calculus.  These  notions  include  the  gradient  of  a  scalar  function  the  divergence 
of  a  vector  field,  as  well  as  key  integral  theorems. 

The  idea  is  exactly  the  same  as  in  one  dimension,  namely  to  formulate  an 
integral  conservation  law.  Let  ft  be  a  region  in  space  where  heat  is  flowing, 
and  let  u  =  u{pc,  y ,  z,  t)  be  the  temperature  at  time  t  at  the  point  (x,  y ,  z)  in  ft. 
We  assume  that  the  region  is  homogeneous  and  is  characterized  by  a  constant 
specific  heat  c  and  a  constant  density  p.  Now  let  B  be  an  arbitrary  sphere 
contained  in  ft.  See  Figure  1.14. 


Figure  1.14  Arbitrary  sphere  B  contained  in  a  region  ft 


We  apply  an  energy  balance  principle  to  B ,  which  states  that  the  rate  of 
change  of  the  total  heat  energy  in  B  must  equal  the  rate  at  which  heat  flows 
into  B  across  its  boundary  plus  the  rate  at  which  heat  energy  is  generated  in  B 
by  sources.  The  total  amount  of  heat  in  a  small  volume  element  dV  =  dxdydz 
is  cpudV ,  and  thus  the  total  amount  of  heat  energy  in  B  is  given  by  the  three- 
dimensional  integral 


Total  amount  of  heat  energy  in  B 


cpu  dV. 


3  For  concise  notation,  double  and  triple  integrals  are  often  written  as  a  single  inte¬ 
gral  with  the  region  of  integration  as  a  subscript.  The  context  should  indicate  the 
type  of  integral,  two-  or  three-dimensional. 
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We  assume  that  the  heat  sources  (or  sinks)  are  given  by  a  point  function  /  = 
f(x,y,  z,£),  where  fdV  is  the  rate  at  which  heat  is  generated  in  dV ;  thus,  the 
rate  at  which  heat  is  generated  in  the  whole  of  B  is 


Rate  that  heat  energy  is  produced  in  B 


fdV. 


Note  that  /  has  dimensions  of  energy  per  unit  volume,  per  unit  time. 

Next  we  introduce  the  heat  flux  vector  0  =  4>(x,  y ,  z,  £);  its  direction  corre¬ 
sponds  to  the  direction  of  heat  flow  at  position  (x,  y ,  z)  at  time  t.  The  rate  at 
which  heat  flows  across  an  oblique  surface  element  dA  oriented  by  the  outward 
unit  normal  vector  n  is 

0  •  n  dA. 


See  Figure  1.15. 


Figure  1.15  Heat  flux  through  a  surface  element  dA  oriented  by  its  outward 
unit  normal  vector  n  is  <fi  •  n  dA 


Consequently,  the  net  rate  that  heat  energy  flows  across  the  boundary4  of 
F>,  denoted  by  dB ,  is  the  surface  integral 


n  dA. 


So,  the  conservation  law,  or  energy  balance  law,  is 


d_ 

dt 


cpu  dV 


<fi  •  n  dA  + 


(1.50) 


The  minus  sign  appears  on  the  flux  integral  because  if  the  flow  is  outward,  the 
integral  is  positive;  the  minus  sign  will  cause  the  net  energy  on  the  left  side  of 
the  equation  to  decrease,  which  is  correct. 

Next  we  apply  one  of  the  fundamental  integral  relations  from  multi- variable 
calculus — the  divergence  theorem.  It  permits  us  to  rewrite  the  flux  integral 
in  (1.50)  as  a  volume  integral.  The  divergence  theorem,  a  three-dimensional 
version  of  the  fundamental  theorem  of  calculus. 


Two  common  notations  for  the  boundary  of  a  set  B  are  dB  and  Bd  B. 


4 
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"heorem  1.22 


(Divergence  theorem)  If  the  vector  field  </>  is  continuously  differentiable  on 
the  region  B  and  continuous  on  B  U  dB ,  where  dB  is  the  boundary,  then 


div  <j)  dV 


•  n  dA. 


□ 


Using  the  divergence  theorem  we  can  write  energy  balance  equation  (1.50) 


as 


d 


—  /  cpu  dV 
dt  JB  r 


div  0  dV  +  /  fdV. 
b  J  B 

Now  we  can  bring  the  time  derivative  under  the  integral  on  the  left  side  and 
finally  rearrange  all  the  terms  under  one  volume  integral  to  obtain 


/  ( cput  +  div  cj)  —  f)  dV  =  0. 

J B 


This  balance  law  must  hold  for  every  sphere  B  in  i2,  and  therefore  the  integrand 
must  vanish,  giving  the  partial  differential  equation 


cput  +  div  </>  =  /  (1.51) 

for  all  t  and  all  (x,y,z)  E  17.  Equation  (1.51)  is  the  local  form  of  the  heat 
energy  conservation  law  in  three  dimensions. 

The  conservation  law  still  contains  two  unknowns,  the  scalar  temperature 
u  and  the  vector  heat  flux  </>.  A  constitutive  relation  can  be  postulated  to 
connect  the  two.  One  such  relation  is  the  three-dimensional  version  of  Fick’s 
law,  or  Fourier’s  heat  conduction  law,  which  states,  consistent  with  the 
laws  of  thermodynamics,  that  heat  flows  down  the  gradient.  In  symbols, 


cj)  =  —K  gradrn 


(1.52) 


Recall  from  calculus  that  the  negative  gradient  of  a  function  points  in  the 
direction  of  maximum  decrease.  The  proportionality  constant  K  is  the  thermal 
conductivity.  Substituting  (1.52)  into  (1.51)  and  using  the  identity 

div(grad)  u  =  uxx  +  uyy  -j-  uzz , 

we  obtain  the  heat  equation  for  the  temperature  u  =  u(x,  y,  x,  t)  in  three  dimen¬ 
sions: 

CpUt  K(llXx  T  ^ yy  H”  ^zz)  —  /•  (1.53) 

The  expression  uxx  +  uyy  +  uzz  is  called  the  Laplacian  of  u ,  and  it  is  denoted 
in  this  text  by  A u.  That  is5, 


A  u  = 


u 


ry*  ry~> 


T  uyy  -\-  u 


zz 


5  In  science  and  engineering  texts,  the  Laplacian  is  often  denoted  by  V2u. 
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Finally,  in  summary,  we  can  write  the  heat  conduction  equation  as 

Ut  —  kAu  =  — /,  (1.54) 

cp 

where  the  constant  k  =  K /  (cp)  is  called  the  diffusivity.  Equation  (1.54)  is 
one  of  the  fundamental  equations  of  mathematical  physics. 

Remark.  The  heat  conduction  equation  in  two  dimensions  is  exactly  the  same 
as  (1.54)  where  the  Laplacian  is  two  dimensional: 

Ail  —  uXx  T  ^yy •  (1.55) 

In  two  dimensions  the  region  17  is  a  planar  region  and  df2  is  the  region’s 
boundary,  a  curve  taken  counterclockwise.  □ 

Generally,  we  expect  that  the  three-dimensional  heat  equation  (1.54),  and 
similarly  for  the  two-dimensional  version,  will  come  with  auxiliary  conditions 
in  the  form  of  an  initial  temperature  condition 

u(x,  y,  z,  0)  =  uo(x,  y,  z)  in  17, 

as  well  as  conditions  on  the  boundary  df2. 

If  the  source  term  /  and  the  boundary  conditions  on  9 17  are  independent  of 
time,  then  we  expect  that  the  body  17  would  eventually  come  to  an  equilibrium 
temperature  u  =  u(x,  y,  z )  that  is  independent  of  time.  This  would  occur  long 
after  the  transients,  or  effects  of  the  initial  condition,  decay  away.  From  (1.54), 
because  ut  =  0,  we  infer  that  this  equilibrium  temperature  will  satisfy  the  PDE 

—kAu  =  — /, 
cp 

which  is  Poisson’s  equation.  If  there  are  no  sources,  f(x,y,z)  =  0,  then 
u  =  u(x,  y ,  z)  satisfies  Laplace’s  equation 

A^  =  0  in  17.  (1.56) 

Laplace’s  equation,  which  is  discussed  in  the  next  section  and  in  Chapter  4,  is 
one  of  the  most  famous  models  in  the  mathematical  sciences,  and  it  governs 
many  phenomena  besides  equilibrium  temperatures. 

Because  Laplace’s  equation  (1.56)  does  not  contain  time,  we  expect  that 
only  boundary  conditions  should  be  appended  in  order  to  have  a  physically 
well-posed  problem.  We  can  prescribe  the  temperature  on  the  boundary  of  17, 
that  is, 

u(x,y,z)  =  g(x,y,z),  {x,y,z)&dtt, 
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which  is  a  Dirichlet  condition;  or  we  can  prescribe  the  heat  flux  on  the 
boundary,  that  is, 


— if  grad  u  •  n  =  h(x ,  y,  z),  (x,  y,  z)  G 

which  is  a  Neumann  condition.  If  no  heat  flows  across  the  boundary,  then 
h  =  0,  and  we  say  that  the  boundary  is  insulated.  As  in  one  dimension,  we 
may  also  have  s  radiation  condition  of  the  form 

— K gT&&u  •  n  =  —  h(u  —  g),  on  df2. 

where  h  >  0  is  the  heat  loss  coefficient  and  g  =  g{pc,  y ,  z)  is  the  given  environ¬ 
mental  temperature.  An  alternate  notation  for  the  derivative  on  the  left  side  is 
gradr/,  •  n  =  which  is  called  the  normal  derivative  of  u  in  the  direction 
of  the  outward  unit  normal.  Thus  the  radiation  condition  is  often  written 

—K——  =  —h(u  —  g)1  on  dQ. 

LL I L 

For  beginning  students,  learning  the  notation  associated  with  partial  dif¬ 
ferential  equations,  especially  in  higher  dimensions,  may  be  a  difficult  task.  As 
you  use  the  notation  and  become  comfortable  with  it,  it  should  become  evident 
that  it  is  only  notation  and  not  as  difficult  as  it  appears. 


EXERCISES 

1.  Let  u  =  u(x,  y ,  z)  be  a  scalar  field.  Verify  that 

div  (grad'u)  =  uxx  +  uyy  +  uzz. 

2.  What  is  the  form  of  the  conservation  law  (1.51)  if  the  thermal  conductivity 
K  in  Fourier’s  law  (1.52)  depends  on  position,  i.e.,  K  =  K(x,  y ,  z)l  (In  this 
case  we  say  that  the  medium  is  heterogeneous.) 

3.  Suppose  u  =  u(x,  y ,  z)  is  a  solution  of  the  Neumann  problem 

-KAu  =  /,  (x,y,z)ef2, 

— iFgrad  u  •  n  =  g{pc,  y,  z),  (x,  y ,  z)  G  df2, 

where  /  and  g  are  functions  of  x,y,z.  Show  that  /  and  g  must  satisfy  the 
relation 

f  f  dV  =  f  gdA. 

J  Q  J  dQ 

In  terms  of  steady  heat  flow,  what  is  the  physical  meaning  of  this  relation? 
Hint:  Integrate  the  PDE. 
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4.  Let  w  be  a  scalar  field  and  0  a  vector  field.  Verify  the  vector  identity 

div  (rc0)  =  0  •  grad  w  +  w  div  0. 

Integrate  this  equation  over  17  and  take  0  =  grad  u ,  where  u  is  a  scalar 
field,  to  prove  Green’s  identity 


wAu dV = 


grad  u-  grad  re  dV  +  /  w  gradr/,  •  ndA 


'ar? 


5.  Show  that  if  the  Dirichlet  problem 


A u  =  A u,  (x,  y ,  z)  G  17, 
u  =  0,  (x,  2/,  z)  G  5i7, 

has  a  nontrivial  solution  u  =  u(x,  ?/,  z),  then  A  must  be  negative.  Hint: 
Multiply  the  equation  by  u  and  integrate  over  17;  use  Green’s  identity  in 
Exercise  4. 

6.  (Dirichlet’s  principle)  Suppose  that  u  satisfies  the  Dirichlet  problem 


A u  =  0,  (x,  y ,  z)  G  17, 

U  =  f,  (x,y,z)edf2. 


Show  that 


|  grad  u  | 2  dV  <  /  |  grad  w  \2  dV 

f2  J  Q 


for  any  other  function  w  that  satisfies  w  —  f  on  917.  Thus,  the  solution  to 
the  Dirichlet  problem  minimizes  the  energy 


grad  w 


2 


dV. 


Hint:  Let  w  =  u  +  v,  where  v 
using  Green’s  identity. 


0  on  d  17,  and  expand  fQ 


grad  w 


2 


dV 


7.  If  heat  is  both  diffusing  and  advecting  in  a  spatial  domain  with  flux  given 

by 

0  =  —Kgididu  +  irv, 


where  v  is  nonconstant  vector  advection  velocity,  show  that  the  governing 
advection-diffusion  equation  is  given  by 


cput  =  KAu  —  (div  v)u  —  v  •  gradro 
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1.8  Laplace’s  Equation 


In  the  last  section  we  derived  Laplace’s  equation: 

A u  =  uxx  +  uyy  =  0  (two  dimensions) 

A u  =  uxx  +  Uyy  +  uzz  =  0  (three  dimensions) 

We  observed  that  Laplace’s  equation  models  steady  heat  flow  in  a  region  where 
the  temperature  is  fixed  on  the  boundary.  Working  in  two  dimensions,  we  show 
what  Laplace’s  equation  tells  us  physically.  Let  (x,  y)  be  some  point  in  the 
region  where  heat  is  flowing  and  let  h  be  a  small  increment.  By  Taylor’s  theorem 
we  have 

1  1 

u(x  —  h,y)  =  u( x,  y)  -  hux(x,  y)  +  -h2uxx(x,  y)  -  -h3uxxx(x,  y)0(h4), 

2  o 

where  0(h4)  denotes  the  remaining  terms  (the  error).  Moreover, 

1  i 

u(x  +  h,y)  =  u(x,  y)  +  hux(x,  y)  +  -h2uxx(x,  y)  +  -h3uxxx(x,  y)0{hA). 

2  o 

Adding  these  two  expressions  and  solving  for  uxx  gives 

u(x-h,y)-2u(x,y)  +  u(x  +  h,y) 

—  j^2  T  0(h  ), 

where  O (h2)  denotes  terms  that  are  quadratic  in  h.  This  expression  on  the  right 
side  is  a  difference-quotient  approximation  to  the  second  partial  derivative  uxx 
at  (#,?/),  and  the  error  is  proportional  to  h2 .  Performing  the  same  calculation, 
but  now  incrementing  y  instead  of  x,  we  obtain  the  approximation 


u(x,y-h)-2u(x,y)+u{x,y  +  h)  2x 

Uyy(X ,  y)  =  - — -  -I-  O (ft  ). 

Therefore,  Laplace’s  equation  at  (x,  y )  can  be  approximated  by  the  equation 


A  u(x,y) 


u(x  —  h,y)  —  2u(x,  y)  +  u(x  +  h,  y) 

h 2 

+  u(x,  y  -h)  -  2 u(x,  y)  +  u(x,  y  +  h)  + 


0. 


Solving  for  u(x,  y)  gives,  approximately,  upon  neglecting  the  small-order  terms, 

1 

u(x,  y)  =  —(u(x  —  h,  y)  +  u{x  +  h,  y)  +  u(x,  y  —  h)  +  u(x,  y  -j-  h)).  (1-57) 

Equation  (1.57)  states,  physically,  that  the  temperature  at  a  point  (x,  y)  is 
approximately  the  average  of  the  temperatures  at  the  four  nearby  points 
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Figure  1.16  If  u  satisfies  Laplace’s  equation,  then  the  temperature  at  (x,  y) 
is  approximately  the  average  of  the  temperatures  at  the  four  nearby  points 


(x  —  h,  y),  {x  +  h,  y),  (x,  y  —  h),  (x,  y  +  h).  See  Figure  1.16.  Therefore,  the  tem¬ 
perature  at  (x,  y)  cannot  exceed  the  temperatures  at  the  neighboring  points; 
so  a  maximum  could  not  occur  at  (x,  y).  Similarly,  a  minimum  could  not  occur 
at  (x,  y). 

This  important  physical  interpretation  can  be  extended  to  a  circle.  If  u  — 
u(x,y)  satisfies  Laplace’s  equation  at  a  point  P  :  (xoj2/o)  in  a  region,  then  the 
temperature  at  P  is  the  average  value  of  the  temperature  on  any  circle  Cr(P) 
of  radius  R  centered  at  P  lying  in  the  region;  in  symbols, 


u(x0,yo) 


1 


2ttR 


Cr(P ) 


u(x ,  y)ds. 


The  integral  here  is  a  line  integral  along  the  curve  Cr(P),  taken  counterclock¬ 
wise.  Of  course,  this  result  generalizes  to  three  dimensions:  The  steady-state 
temperature  at  a  point  is  the  average  of  the  temperatures  over  the  surface  of 
any  sphere  centered  at  that  point. 

If  the  value  of  u  at  a  point  is  the  average  of  the  values  on  any  circle  about 
that  point,  then  the  value  of  u  at  that  point  cannot  exceed  every  value  of  u 
on  any  given  circle  (assuming  u  is  not  a  constant  function).  This  implies  that 
a  function  u  satisfying  Laplace’s  equation  in  a  given  circle  cannot  have  a  local 
maximum  or  local  minimum  at  a  point  inside  that  circle;  the  maximum,  or 
minimum,  must  therefore  occur  on  the  boundary.  Indeed,  this  even  true  for 
much  general  regions,  and  the  result  is  called  the  maximum  principle.  We  state 
the  result  precisely. 
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Theorem  1.23 

(The  maximum  principle)  Let  u(x,  y)  satisfy  Laplace’s  equation  in  17,  an 
open,  bounded,  connected  region  in  the  plane;  and  let  u  be  continuous  on  the 
closed  domain  17  U  df2  consisting  of  17  and  its  boundary.  If  u  is  not  a  constant 
function,  then  the  maximum  and  minimum  values  of  u  are  attained  on  the 
boundary  of  17  and  nowhere  inside  17.  □ 


This  result  is  also  true  in  three  dimensions.  This  is  one  of  the  fundamental 
properties  of  solutions  to  Laplace’s  equation. 

The  proof  of  the  maximum  principle  is  not  difficult  and  requires  only  a 
couple  of  concepts  from  mutivariable  calculus.  We  show  that  the  maximum 
must  occur  on  the  boundary;  the  argument  for  the  minimum  is  similar.  Let 
w(x,  y )  =  u(x,  y)  +  e  ( x 2  +  y 2) ,  where  e  >  0.  Then 

Aw  =  A u  +  4e  =  4e  >  0. 


Therefore  Aw  >  0  in  17.  From  calculus,  if  w  has  a  maximum  at  a  point  P  = 
(x,y)  in  17,  then  at  P  we  know  wxx,  wyy  <  0.  But  this  contradicts  Aw  >  0. 
Thus  the  maximum  must  occur  at  some  point  Pm  =  (#m?2/m)  on  d  17.  Then, 


u(x,  y)  =  w(x,  y)  -  e  ( x 2  +  y2)  <  w(x,  y) 

<  w(Pm )  =  u(Pm)  +  £  Pm\\2  <  u(Pm )  +  zL2 


where  L  is  the  maximum  distance  from  the  origin  to  the  boundary  of  17.  There¬ 
fore,  because  e  is  arbitrary, 


u{x,y)  <  u(PM) 


for  all  (x,  y)  in  17.  □ 

Polar  and  Spherical  Coordinates 

Many  two-dimensional  problems  involving  Laplace’s  equation  are  in  regions 
(circles,  wedges,  annuli)  that  lend  themselves  to  a  polar  description  in  terms  of 
r  and  0,  rather  than  rectangular  coordinates  x  and  y.  This  means  that  we  need 
an  expression  for  the  Laplacian  in  terms  of  polar  coordinates;  the  connection 
between  polar  and  rectangular  coordinates  is  given  by  the  familiar  formulae 

x  =  r  cos  0,  y  =  r  sin  0, 


r  =  yj x2  +  y2,  0  =  arctan  — 

x 


or,  inversely, 
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The  chain  rule  allows  us  to  calculate  the  partial  derivatives  uxx  and  uyy  in 
terms  of  derivatives  with  respect  to  r  and  6.  First,  we  calculate  du/dx ;  we  have 

du  .  f  sin  0 

—  urrx  +  uqUx  =  [cos  U)ur - uq, 


dx 


So,  an  x  partial  derivative  produces  the  operation 


r 


9  nd 

=  cos  6 


sin  9  d 


dx 


dr 


r  dO 


in  polar  coordinates.  Therefore,  the  second  partial  is 


d_ 

dx 


d  sin  6  d 
COSddr  r~d9 


—  cos2  6 


d2  2  sin  0  cos  6  d2 


dr2  r  drdO 

sin2  6  d2  2  cos  6  sin  6  d  sin2  6  d 
+  — + - -w - wv  + - 


r 


dO2 


r 


dO 


r 


dr 


The  reader  should  verify  these  calculations.  Similarly,  one  finds  that 


d_ 

dy 


d  cos  0  d 
sin  0— — h 


dr 
d2 

=  sin2  0—  + 
drz 

cos2  6  d2 

+ 


r  dO 
2  sin  6  cos  6  d2 


ry*  * 


dO2 


r  drdO 

2  cos  6  sin  6  d  cos2  6  d 

+ - 


r 


dO 


r 


dr 


Upon  adding  these  two  equations  there  is  a  lot  of  cancelation,  and  we  obtain 

1  1 

/\u  —  Urr  T  — Ur  T  — T^Uqq  —  0, 


r 


r 


which  is  Laplace’s  equation  in  polar  coordinates. 

In  three  dimensions,  problems  often  occur  on  regions  that  have  cylindrical 
symmetry  (e.g.,  portions  of  cylinders)  or  spherical  symmetry  (e.g.,  portions  of 
spheres).  Therefore,  it  is  useful  to  have  expressions  for  the  Laplacian  in  cylindri¬ 
cal  coordinates  r,  0,  z  and  spherical  coordinates  p,  0,  0.  Cylindrical  coordinates 
in  space  are  defined  by  the  equations 


x  =  r  cos  0,  y  =  r  sin  0,  z  =  z, 


which  are  just  polar  coordinates  in  the  plane  appended  by  z.  The  Laplace’s 
equation  in  cylindrical  coordinates  is 


A  u  =  urr 


1 

H — ur  -f- 
r 


1 

~^Uqq  T  Uzz 
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Spherical  coordinates  (see  Figure  1.17)  are  related  to  rectangular  coordinates 
by  the  transformation  formulae 

x  =  p  sin  0  cos  0,  y  —  p  sin  0  sin  0,  z  =  p  cos  0. 

Laplace’s  equation  in  spherical  coordinates  is 

2  1  1 

A u  =  upp  +  -Up  +  — - — -(sin +  .  2  u9e  =  0. 

p  pz  sm  0  p 2  sin  0 

Clearly,  it  is  a  tedious  calculation  using  the  chain  rule  to  derive  this  formula, 
and  we  leave  its  verification  as  an  exercise. 


Figure  1.17  Spherical  coordinates  p,  0,  and  0 


When  symmetries  are  involved,  these  expressions  for  the  Laplacian  can 
simplify  significantly. 

Example  1.24 

Find  the  steady  temperatures  in  the  region  between  two  concentric  spheres  of 
radii  p  =  1  and  p  =  2  if  the  inner  sphere  is  held  at  zero  degrees  and  the  outer 
sphere  is  held  at  10  degrees.  Because  there  is  no  variation  of  the  temperature 
on  the  boundaries  in  the  longitudinal  (6)  direction  or  in  the  azimuthal  (0) 
direction,  it  is  clear  that  the  steady  temperature  depends  only  on  the  distance 
p  from  the  center  of  the  coordinate  system.  That  is,  u  =  u(p).  Then,  in  spherical 
coordinates,  Laplace’s  equation  simplifies  to 

2 

u"(p)  H — u\p)  =  0. 

P 

.y  V) = o. 


This  is  the  same  as 
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Integrating  twice  with  respect  to  p  gives 

/  \  a  7 
U\P)  =  -  +  o, 

P 

where  a  and  b  are  constants  of  integration.  Now,  7/(1)  =  0  forces  b  =  —a, 
and  then  7/(2)  =  10  forces  a  =  —20.  Therefore,  the  steady  temperatures  vary 
radially  according  to 

u(p)  =  20  ^1 - ^  .  □ 


EXERCISES 

1.  In  two  dimensions  suppose  u  =  u(r ,  0)  satisfies  Laplace’s  equation  A u  =  0 
in  the  disk  0  <  r  <  2,  and  on  the  boundary  it  satisfies  7/(2,  6)  =  3  sin  20  + 1, 
0  <  0  <  27 r.  What  is  the  value  of  u  at  the  origin?  Where  do  the  maximum 
and  minimum  of  u  occur  in  the  closed  domain  0  <  r  <  2? 

2.  Solve  Laplace’s  equation 

^ xx  T  ^ yy  —  1 

in  the  circle  r  <  a  with  u  =  0  on  r  =  a. 

3.  Laplace’s  equation  in  one  dimension  is  u" {pc)  =  0,  which  has  general  solu¬ 
tion  u{x)  =  ax  +  6,  where  a  and  b  are  constants.  Find  the  steady-state 
temperature  distribution  in  a  metal  bar  of  length  l  whose  ends  at  x  =  0 
and  x  =  l  are  held  at  temperatures  Tq  and  Ti,  respectively.  Sketch  a  graph 
showing  this  distribution.  Suppose  the  right  end  at  x  =  l  is  insulated;  now 
what  is  the  steady-state  temperature  distribution  if  the  left  end  is  held  at 
To  degrees? 

4.  Solve  Au  =  1  in  the  spherical  shell  a  <  p  <  b  with  u  =  0  on  p  =  a  and 
Up  =  0  on  p  =  b. 

5.  Suppose  a  metal  bar  of  unit  length  has  a  variable  conductivity  K{x)  given 
by  K{x)  =  1  +  x2,  0  <  x  <  1.  What  is  the  steady-state  temperature  dis¬ 
tribution  u{x)  if  'u(O)  =  1  and  u{  1)  =  4?  (Recall  that  steady  temperatures 
are  modeled  by  the  equation  —  [K{x)u')r  =  0.) 

6.  Find  all  radial  solutions  to  the  two-dimensional  Laplace’s  equation.  That 
is,  find  all  solutions  of  the  form  u  =  'u(r),  where  r  =  x2  +  y2 .  Find  the 
steady-state  temperature  distribution  in  the  annular  domain  1  <  r  <  2  if 
the  inner  circle  r  =  1  is  held  at  0  degrees  and  the  outer  circle  r  =  2  is  held 
at  10  degrees. 
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7.  A  spherical  shell  with  inner  radius  1  and  outer  radius  2  has  a  steady-state 
temperature  distribution.  Its  inner  boundary  is  held  at  100  degrees  and  at 
its  outer  boundary  up  =  —  b  <  0.  (a)  Find  the  temperature  distribution,  (b) 
Can  you  find  a  value  of  b  so  that  the  temperature  on  the  outer  boundary 
is  20  degrees? 

8.  Show  that  Laplace’s  equation  uxx  +  uyy  =  0  in  two  dimensions  is  invariant 
under  a  translation  of  coordinates  £  =  £  +  a,  r]  =  y  +  b;  that  is,  show 

^rjT]  —  0. 

Show  that  it  is  invariant  under  rotations 

£  =  x  cos  a  +  y  sin  a,  y  =  —  x  sin  a  +  y  cos  a. 

9.  In  a  three-dimensional  spherical  region  D  free  of  charges,  the  static  electric- 
field  vector  E  satisfies  the  two  Maxwell  equations 

divE  =  0,  curlE  =  0. 

Let  V  =  V(x,y,z )  be  the  electrical  potential  defined  by  gradV  =  E. 
(Review  multi- variable  calculus  to  see  why  such  a  potential  exists.)  Show 
that  AV  =  0.  That  is,  the  static  electric  field  potential  satisfies  Laplace’s 
equation. 

10.  This  exercise  illustrates  an  important  numerical  procedure  for  solving 
Laplace’s  equation  on  a  rectangle.  Consider  Laplace’s  equation  on  the  rect¬ 
angle  D  :  0  <  x  <  4,  0  <  y  <  3  with  boundary  conditions  given  on  the  bot¬ 
tom  and  top  by  u(x,  0)  =  0,  u(pc,  3)  =  0  for  0  <  x  <  4  and  boundary  con¬ 
ditions  given  on  the  sides  by  u( 0,  y)  =  2y{3  —  y),  u( 4,  y)  =  0  for  0  <  y  <  4. 
Apply  the  average  value  property  (1.45)  with  h  —  1  at  each  of  the  six  lattice 
points  (1, 1),  (1,  2),  (2, 1),  (2,  2),  (3, 1),  (3,  2)  inside  D  to  obtain  a  system  of 
six  equations  for  the  six  unknown  temperatures  %  —  1,2,  3;  j  =  1,  2. 

Solve  the  system  to  approximate  the  steady  temperature  distribution  and 
plot  the  resulting  approximate  surface  using  a  software  package. 


1.9  Classification  of  PDEs 

An  ordinary  differential  equation  is  classified  according  to  its  order  (the  highest 
derivative  that  appears)  and  whether  it  is  linear  or  nonlinear.  PDE  models  are 
more  difficult  to  classify.  Not  only  are  order  and  linearity  important  issues, 
but  the  PDE  structure  is  equally  important.  In  fact,  the  structure  of  a  PDE 
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dictates  what  types  of  boundary  and  initial  conditions  can  be  imposed  to  yield 
a  well-posed  problem,  and  ultimately  what  types  of  physical  processes  the  PDE 
covers. 

To  fix  the  notion,  we  consider  second-order  equations.  We  have  already 
obtained  three  model  equations: 


u+  —  kuxx  =  0 

'U'tt  ^  'LL xx  —  0 

'LL XX  U yy  —  0 


(diffusion  equation), 
(wave  equation), 
(Laplace’s  equation). 


The  first  two  are  evolution  equations ,  which  describe  how  a  process  evolves  in 
time;  we  expect  that  each  would  require  initial  data  that  give  the  state  of  the 
system  at  time  t  =  0.  The  third  is  an  equilibrium  equation  where  time  is  not 
an  issue — we  expect  that  only  boundary  conditions  are  appropriate.  Moreover, 
we  infer  that  solutions  of  these  three  equations  behave  differently.  Diffusion 
tends  to  smear  out  signals,  whereas  waves  tend  to  propagate  signals  with  some 
coherence  (e.g.,  a  traveling  wave);  solutions  to  Laplace’s  equation  are  steady- 
state  and  do  not  vary  in  time  at  all.  Therefore,  if  we  were  to  classify  these  three 
equations  physically,  we  would  call  them  diffusive,  wave-like,  and  static, 
respectively.  But  there  is  a  consistent  characterization  based  on  the  form  of  the 
equation  alone. 

In  general,  let  us  consider  a  second-order  differential  equation  of  the  form 


Auxx  T  Buxt  T  Cutt  T  B(x^  t,  tq  uXl  tq)  —  0,  (1.58) 

where  A ,  B ,  and  C  are  constants.  Notice  that  the  second-order  derivatives  are 
assumed  to  appear  linearly,  or  to  the  first  degree;  the  expression 


Lu  =  Auxx  ~\~  Buxt  +  Cutt 


is  called  the  principal  part  of  the  equation,  and  it  is  on  this  part  that  we 
base  the  classification.  Observe  that  we  used  independent  variables  x  and  t\ 
should  the  problem  involve  only  spatial  variables,  we  would  replace  t  by  y.  The 
classification  is  based  on  the  sign  of  the  quantity 

D  =  B2  -  4AC, 

which  is  called  the  discriminant. 

Definition.  If  D  >  0,  we  say  that  (1.58)  is  hyperbolic;  if  D  <  0,  we  say  that 
(1.58)  is  elliptic;  if  D  =  0,  we  say  that  (1.58)  is  parabolic.  □ 

Under  this  classification,  it  is  easy  to  check  that  the  diffusion  equation  is 
parabolic ,  the  wave  equation  is  hyperbolic ,  and  Laplace’s  equation  is  elliptic. 
This  terminology  comes  from  the  familiar  classification  of  plane  curves;  for 
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example,  Ax2  +  Ct 2  =  1,  where  A,  C  >  0,  B  =  0  (and  so  D  <  0)  plots  as  an 
ellipse  in  the  xt- plane,  and  Ax2  —  Ct2  =  1  plots  as  a  hyperbola,  and  so  forth.  As 
it  turns  out,  all  parabolic  equations  are  diffusion-like,  all  hyperbolic  equations 
are  wave-like,  and  all  elliptic  equations  are  static.  Later  we  show  why  this  is 
true. 

If  A,  B ,  and  C  are  not  constants,  but  rather  functions  of  the  independent 
variables  x  and  £,  then  the  discriminant  D  =  D(pc,t)  depends  on  x  and  t.  We 
make  the  same  definitions  in  this  case,  but  now  the  sign  of  D  can  change, 
depending  upon  the  domain.  Problems  that  change  type  from  one  region  to 
another  can  be  difficult,  and  we  do  not  consider  them  in  this  text.  Classification 
of  systems  of  PDEs  is  also  possible,  as  well  as  classification  of  PDEs  with 
nonlinear  principal  parts  where  the  coefficients  A,  B ,  and  C  depend  on  the 
solution  u  itself.  The  reader  is  referred  to  a  more  advanced  treatment  for  a 
discussion. 

Now  we  show  that  the  principal  part  Lu  in  equation  (1.58)  can  be  simplified 
for  each  of  the  three  types  by  introducing  a  new  set  of  independent  variables. 
This  strategy  of  searching  for  a  change  of  variables  that  simplifies  a  problem  is 
common  in  differential  equations.  We  examine  the  case  where  A ,  E>,  and  C  are 
constant,  and  we  seek  a  linear  transformation 

£  =  ax  +  bt,  r  =  cx  +  dt 

that  simplifies  the  PDE.  Here  £  and  r  are  new  independent  variables,  and 
a,  6,  c,  and  d  are  to  be  determined.  We  assume  that  ad  —  be  ^  0,  so  that 
the  transformation  is  invertible,  that  is,  we  may  solve  for  x  and  t  in  terms  of 
£  and  r.  The  dependent  function  u  in  the  new  variables  will  be  denoted  by 
U  =  t/(£,  r);  that  is,  u(x,  t)  =  U (ax  +  cx  +  dt).  Then,  by  the  chain  rule  for 
derivatives, 


^ x  —  T  Uttx  —  aU ^  T  c/7r, 

Ut  =  U^t  T  UrTf  =  bU{:  +  dUT. 


To  get  the  second  partial  requires  another  application  of  the  chain  rule.  We 
have 


d 


u 


_  _(a[/5  +  cC/t) 

—  a(aU^  T  cU^r)  T  c(aU^T  T  cUTT) 
=  n2TJcc  -4-  (2n.rTIc~  4-  r2TJ — 


The  other  two  second  partial  derivatives  are  computed  similarly,  and  we  get 

Un  —  b2U^  T  2bdU^r  T  d2UTT , 
uxt  =  abU ££  (ad  -j-  cb)U^T  T  cdUTT. 
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Substituting  these  quantities  into  the  principal  part  and  collecting  terms,  we 
get 

Auxx  +  Buxt  +  Cutt  =  (Aa2  +  Bab  +  Cb2)U ^  (1.59) 

+  (2acA  +  B(ad  +  be)  +  2Cbd)U^T  (1.60) 

+  (Ac2  +  Bed  +  Cd2)UTT.  (1.61) 


Now  we  can  select  a,  6,  c,  d  so  that  some  of  the  second  partial  derivatives  in  the 
new  variables  disappear.  This  process  must  be  handled  differently  depending 
on  the  sign  of  the  discriminant  D. 


Hyperbolic  case,  D  >  0.  We  have  some  flexibility,  so  let  us  choose  a  =  c  = 
1.  Then  the  coefficients  of  U ^  and  t/TT,  which  have  the  same  form,  become 
quadratic  expressions  in  b  and  d,  respectively,  and  we  can  force  those  coefficients 
to  vanish  by  choosing 


-b  +  Vd 

2 C 


-b-Vd 

2 C 


(1.62) 


Here  we  have  just  used  the  quadratic  formula.  The  remaining  coefficient,  that  of 
U^T,  is  nonzero  (we  leave  this  as  an  exercise).  In  summary,  the  transformation 


£  =  x  + 


-b  +  Vd 
2 C 


t,  T  =  X  + 


—B  -  \[D 

2 C 


t 


(1.63) 


transforms  the  PDE  (1.58)  into  a  simpler  equation  of  the  form 


U^r  +  G(£,  r,  U,  Ug,  UT)  =  0, 

where  only  the  mixed  second  partial  derivative  appears.  Thus  there  is  a  signif¬ 
icant  simplification  over  (1.63),  where  all  the  second  partial  derivatives  occur. 

This  latter  equation  is  called  the  canonical  form  of  a  hyperbolic  equation, 
and  the  coordinates  £  and  r  defined  by  (1.63)  are  called  the  characteristic 
coordinates.  Transformation  to  these  coordinates  is  almost  always  a  preferred 
strategy  for  hyperbolic  PDEs. 

Finally,  if  C  =  0,  then  (1.62)  is  not  valid;  in  this  case  select  b  =  d  =  1  and 

—B  +  \[~D  —B  —  \f~D 

U  =  - : - ,  C  =  - ; - . 

2  A  2  A 


Parabolic  case,  D  =  0.  Now  equations  (1.62)  give  b  =  d,  and  the  resulting 
transformation  £  =  x  +  r  =  x  +  bt  is  not  invertible.  So  we  must  proceed 
differently.  Observe  that  if  we  choose  a  =  c  =  1,  d  =  —B/(2C),  and  6  =  0, 
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then  the  coefficients  of  UTT  and  U^T  in  (1.61)  both  vanish  (another  exercise!). 
Hence,  the  transformation 

,  B 

t,=X,  T  =  X  —  —t 

transforms  equation  (1.58)  into 

Uii  +  H(t,T,U,Ui,UT)  =  0, 

where  only  a  double  partial  appears,  and  this  is  the  canonical  form  in  the 
parabolic  case.  Observe  its  similarity  to  the  heat  equation,  where  there  is  only 
a  single  second  derivative. 

Elliptic  case,  D  <  0.  Now  b  and  d  in  (1.62)  are  conjugate  complex  numbers, 
i.e.,  d  =  b.  Selecting  a  =  c  =  1,  we  obtain  a  complex  transformation 

£  =  x  +  bt,  t  =  x  +  bt. 

But  then,  a  real  transformation  can  be  found  by  taking  real  variables 

It  is  an  exercise  to  show  that  this  transforms  (1.58)  into  the  equation 

Uaa  +  Uf3f3  +  K(a,  /?,  U,Ua,U(3)  =0, 

where  both  double  partial  derivatives  are  present  and  the  mixed  partial  is 
absent;  the  last  equation  is  the  canonical  form  for  elliptic  equations.  We  rec¬ 
ognize  the  combination  of  second  partial  derivatives  as  the  Laplacian  operator. 

Generally,  characteristic  coordinates  are  useful  only  for  hyperbolic  equa¬ 
tions,  and  they  do  not  play  a  particularly  important  role  in  elliptic  and 
parabolic  equations.  First  order  equations,  like  a  reaction-advection  equation, 
are  classified  as  hyperbolic  because  they  propagate  signals  like  the  wave  equa¬ 
tion. 


EXERCISES 

1.  Classify  the  PDE 

^ xx  +  ^kiixt  T  k  Uff  —  0,  k  0. 

Find  a  transformation  £  =  x  +  bt,  r  =  x  +  dt  of  the  independent  variables 
that  transforms  the  equation  into  a  simpler  equation  of  the  form  =  0. 
Find  the  solution  to  the  given  equation  in  terms  of  two  arbitrary  functions. 
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2.  Find  the  solution  of  the  PDE 


2u xx  —  d'Z/'W  -f~  ux  —  0 


in  terms  of  two  arbitrary  functions. 

3.  Classify  the  PDE 

xuxx  —  4uxt  =  0 

in  the  region  x  >  0.  Observe  that  the  PDE  has  variable  coefficients.  Solve 
the  equation  by  making  the  nonlinear  change  of  variables  £  =  t,  r  = 
t  +  4  In  ax  The  solution  is 

u  =  e~1^  f(t  +  4  lnx)  +  g(t ), 

where  /  and  g  are  arbitrary  functions. 

4.  Show  that  the  equation 

Utt  -  C2uxx  +  aut  +  bux  +  du  =  /( x,  t) 
can  be  transformed  into  an  equation  of  the  form 


wiT  +  kw  =  g(€,r),  w  =  w(£,t) 


by  first  making  the  transformation  £  =  x  —  ct,  r  =  x  +  ct  and  then  letting 
u  =  wea^^T  for  some  choice  of  a ,  (3. 


5.  Classify  the  PDE 


u 


ry  ry 
•As  IAS 


bit o? ?y  T  1210,7 1  —  0. 


jxy  i 

Find  a  transformation  of  independent  variables  that  changes  it  to  Laplace’s 
equation. 


6.  Classify  the  PDE 


x  nxx 


T  2xuxt  T  Uff  —  Uf 


and  find  the  general  solution  using  the  nonlinear  transformation  £  =  xe 
T  =  t. 


7.  Classify  the  following  PDEs  with  variable  coefficients  and  sketch  regions 
where  they  are  of  different  type. 

&)  y^XX  2jVjXy  T  XUyy  -  0. 

b)  (1  T  x^uxx  2xyuxy  y  nyy  —  0. 

8.  Show  that  the  diffusion  equation  is  diffusive  and  the  wave  equation  is  nei¬ 
ther  diffusive  or  dispersive.  (See  Exercises  12,  13,  Section  1.1). 


Partial  Differential  Equations 
on  Unbounded  Domains 


In  our  study  of  PDEs  we  noted  the  differences  among  the  types  of  equations: 
parabolic,  hyperbolic,  and  elliptic.  Those  classifications  dictate  the  types  of 
initial  and  boundary  conditions  that  should  be  imposed  to  obtain  a  well-posed 
problem.  There  is  yet  another  division  that  makes  one  method  of  solution  prefer¬ 
able  over  another,  namely,  the  nature  and  extent  of  the  spatial  domain.  Spatial 
domains  may  be  bounded,  like  a  bounded  interval,  or  unbounded,  like  the  entire 
set  of  real  numbers.  It  is  a  matter  of  preference  which  type  of  domain  is  studied 
first.  It  seems  that  boundaries  in  a  problem,  which  require  boundary  conditions, 
make  a  problem  more  difficult.  Therefore,  we  first  investigate  problems  defined 
on  unbounded  domains. 


2.1  Cauchy  Problem  for  the  Heat  Equation 

We  begin  with  the  heat,  or  diffusion,  equation  on  the  real  line.  That  is,  we 
consider  the  initial  value  problem 

Ut  —  kuXxi  %  £  t  0,  (2.1) 

u{pc,  0)  =  </>(#),  xGl.  (2-2) 

Physically,  this  problem  is  a  model  of  heat  flow  in  an  infinitely  long  bar  where 
the  initial  temperature  4>(x)  is  prescribed.  In  a  chemical  or  biological  con- 
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text,  the  equation  governs  density  variations  under  a  diffusion  process.  Notice 
that  there  are  no  boundaries  in  the  problem,  so  we  do  not  prescribe  boundary 
conditions  explicitly.  However,  for  problems  on  infinite  domains,  conditions  at 
infinity  are  sometimes  either  stated  explicitly  or  understood.  Such  a  condition 
might  require  boundedness  of  the  solution  or  some  type  of  decay  condition  on 
the  solution  to  zero  as  x  —>  Too.  In  mathematics,  a  pure  initial  value  problem 
like  (2.1)-(2.2)  is  often  called  a  Cauchy  problem. 

Deriving  the  solution  of  (2.1)-(2.2)  can  be  accomplished  in  two  steps.  First 
we  solve  the  problem  for  a  step  function  4>(x),  and  then  we  construct  the  solu¬ 
tion  to  (2.1)-(2.2)  using  that  solution.  Therefore,  let  us  consider  the  problem 

wt  =  kwxx ,  x  e  M,  t  >  0,  (2.3) 

w(x,  0)  =  0  for  x  <  0;  w(x,  0)  =  uo  for  x  >  0,  (2.4) 


where  we  have  taken  the  initial  condition  to  be  a  step  function  with  jump  uq. 

We  motivate  our  approach  with  a  simple  idea  from  the  subject  of  dimen¬ 
sional  analysis.  Dimensional  analysis  deals  with  the  study  of  units  (seconds, 
meters,  kilograms,  and  so  forth)  and  dimensions  (time,  length,  mass,  and  so 
forth)  of  the  quantities  in  a  problem  and  how  they  relate  to  each  other.  Equa¬ 
tions  must  be  dimensionally  consistent  (one  cannot  add  apples  to  oranges),  and 
important  conclusions  can  be  drawn  from  this  fact.  The  cornerstone  result  in 
dimensional  analysis  is  called  the  pi  theorem .  The  pi  theorem  guarantees  that 
whenever  there  is  a  physical  law  relating  dimensioned  quantities  <7i, . . . ,  <7™, 
then  there  is  an  equivalent  physical  law  relating  the  independent  dimensionless 
quantities  that  can  be  formed  from  #i, . . . ,  </m.  By  a  dimensionless  quantity  we 
mean  one  in  which  all  the  dimensions  (length,  time,  mass,  etc.)  cancel  out.  For 
a  simple  example  take  the  law 


1  2 

--gt  Tut, 


which  gives  the  height  h  of  an  object  at  time  t  when  it  is  thrown  upward 
with  initial  velocity  v;  the  constant  g  is  the  acceleration  due  to  gravity.  Here 
the  dimensioned  quantities  are  /i,  t,  v,  and  g,  having  dimensions  length,  time, 
length  per  time,  and  length  per  time-squared.  This  law  can  be  rearranged  and 
written  equivalently  as 


h__  1 

vt  2 


+  1 


in  terms  of  the  two  dimensionless  quantities 


h  gt 

7Ti  =  —  and  7T2  =  — . 
vt  v 

For  example,  h  is  a  length  and  vt,  a  velocity  times  a  time,  is  also  a  length;  so  7Ti, 
or  h  divided  by  vt ,  has  no  dimensions.  Similarly,  7T2  =  gt/v  is  dimensionless. 
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A  law  in  dimensioned  variables  can  always  be  reformulated  in  dimensionless 
quantities.  So  the  physical  law  can  be  written  as  7Ti  =  +  1. 

We  use  similar  reasoning  to  guess  the  form  of  the  solution  of  the  initial  value 
problem  (2.3)-(2.4).  First  we  list  all  the  variables  and  constants  in  the  problem: 
x,  £,  re,  tzo,  k.  These  have  dimensions  length,  time,  degrees,  degrees,  and  length- 
squared  per  time,  respectively.  We  notice  that  w/uq  is  a  dimensionless  quantity 
(degrees  divided  by  degrees);  the  only  other  independent  dimensionless  quantity 
in  the  problem  is  x  /V Akt  (the  “4”  is  included  for  convenience).  By  the  pi 
theorem  we  expect  that  the  solution  can  be  written  as  some  combination  of 
these  dimensionless  variables,  or 


w  _  n  (  x  \ 

UQ  V  \/4  kt ) 

for  some  function  /  yet  to  be  determined.  In  fact,  this  is  the  case.  So  let  us 
substitute 

w  =  m'  2  =  7E 

into  the  PDE  (2.3).  We  have  taken  uq  =  1  for  simplicity.  The  chain  rule  allows 
us  to  compute  the  partial  derivatives  as 


Wt  =  f\z)zt  =  -- 


1  X 


w 


X 


d 


2  V4 kfi 

1  m 


m 


y/Akt 


1 


Wxx  =  —  ILlr  = 


dx 


7 X 


Akt 


f"(z) 


Substituting  into  (2.3)  gives,  after  some  cancelation,  an  ordinary  differential 
equation, 

f"(z)  +  2zf(z)  =  0, 

for  f(z).  This  equation  is  easily  solved  by  multiplying  through  by  the  integrat- 

2 

ing  factor  ez  and  integrating  to  get 

f{z)  =  Cie-*2, 


where  c\  is  a  constant  of  integration.  Integrating  from  0  to  z  gives 

f(z)  =  Cl  f  e~r2dr  +  c2, 

Jo 

where  c 2  is  another  constant  of  integration.  Therefore  we  have  determined 
solutions  of  (2.3)  of  the  form 


‘X 


/  \/4  kt 


e  r  dr  +  C2 


w(x,  t )  =  Cl 


0 
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Next  we  apply  the  initial  condition  (2.4)  (taking  uq  =  1)  to  determine  the 
constants  c\  and  For  a  fixed  x  <  0  we  take  the  limit  as  t  — >  0  to  get 


■oo 


0  =  w(x,  0)  =  c\  /  e  r  dr  +  C2 

J  o 

For  a  fixed  x  >  0  we  take  the  limit  as  t  0  to  get 

oo 


no o 

1  =  w(x,  0)  =  ci  /  e~r  dr  +  c 2 

Jo 


Recalling  that 


•00 


2  \  7T 

:_r  dr  = 


0 


2  ’ 


we  can  solve  the  last  two  equations  to  get  ci  =  1/v^1?  c2  =  1/2.  Therefore,  the 
solution  to  (2.3)-(2.4)  with  uq  =  1  is 


This  solution  can  be  written  nicely  as 

w(x,  t)  =  —  ( 1  +  erf  f  ^ 
in  terms  of  a  special  function  called  the  “erf”  function,  which  is  defined  by 


Figure  2.1  shows  a  graph  of  several  time  snapshots  of  the  solution  (2.6). 

Now  we  will  use  (2.5)  and  a  physical  argument  to  deduce  a  solution  to  the 
Cauchy  problem  (2.1)-(2.2).  Later,  in  Section  2.7,  we  present  an  analytical 
argument  based  on  Fourier  transforms.  We  make  some  observations.  First,  if  a 
function  w  satisfies  the  heat  equation,  then  so  does  wXl  the  partial  derivative 
of  that  function  with  respect  to  x.  This  is  easy  to  see  because 


0  —  (ay  kwxx^x  —  i^x)t  k(wx^)xx' 

Therefore,  since  w(x,t)  solves  the  heat  equation,  the  function 

G(x,t)  =  wx(x,t) 

solves  the  heat  equation.  By  direct  differentiation  of  w(x,t)  we  find  that 

G(x,t)  =  _4=e-*2/(4fct).  (2.7) 

V4 M 
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x 


Figure  2.1  Temperature  profiles  u—  \  ^1  +  erf  {xjyj^kt  ^  at  three  different 

times  t  when  the  initial  temperature  is  a  step  function  and  k  =  1.  As  time 
increases,  the  profiles  are  smearing  out 


The  function  G  is  called  the  heat  kernel  or  fundamental  solution  to  the 

heat  equation;  the  reader  will  note  that  for  each  t  >  0  it  graphs  as  a  bell-shaped 
curve  (see  Exercise  1,  Section  1.1),  and  the  area  under  the  curve  for  each  t  >  0 
is  one;  that  is, 


G(x,  t)dx  =  1, 


t  >  0. 


G(x,  t )  is  the  temperature  surface  that  results  from  an  initial  unit  heat  source, 
i.e. ,  injecting  a  unit  amount  of  heat  at  x  =  0  at  time  t  —  0.  We  further 
observe  that  shifting  the  temperature  profile  again  leads  to  a  solution  to  the 
heat  equation.  Thus,  G(x  —  y,  £),  which  is  the  temperature  surface  caused  by 
an  initial  unit  heat  source  at  y ,  solves  the  heat  equation  for  any  fixed,  but 
arbitrary,  y.  If  </>(?/),  rather  than  unity,  is  the  magnitude  of  the  source  at  ?/, 
then  cj){y)G(x  —  y,t)  gives  the  resulting  temperature  surface;  the  area  under  a 
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lx 

>  0.5\ 


Figure  2.2  Plot  of  the  fundamental  solution  u  =  G(x,t)  to  the  diffusion 
equation  (2.7).  As  t  0+  the  solution  approaches  a  unit  ‘point  source’  at 


temperature  profile  is  now  </>(?/),  where  y  is  the  location  of  the  source.  Now,  let  us 
regard  the  initial  temperature  function  <fi  in  (2.2)  as  a  continuous  distribution  of 
sources  cj)(y)  for  each  y  E  R.  Then,  superimposing  all  the  effects  4>(y)G(x  —  y,t) 
for  all  y  gives  the  total  effect  of  all  these  isolated  sources;  that  is, 


/oo 

4>{y)G(x  -  y,  t)dy 

-OO 


■OO 

•OO 

■OO 


<t>(y) 


1 


\XiFki 


e-(x-v)2/(A  kt)dy 


is  a  solution  to  the  Cauchy  problem  (2.1)-(2.2)  for  reasonable  assumptions  on 
the  initial  condition  </>.  More  precisely: 


Theorem  2.1 

Consider  the  initial  value  problem  for  the  heat  equation, 

ut  =  kuxx ,  x  e  R,  t  >  0, 
u(x,  0)  =  </>(#),  xGl, 

where  0  is  a  bounded  continuous  function  on  R.  Then 

My)  ,  ^  e 

\f4jrki 


-( x-y)2/{Akt)dy  (2.8) 
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is  a  solution  to  the  heat  equation  for  x  E  R,  t  >  0,  and  it  has  the  property  that 
u(x,t)  —>  cj)(x)  as  t  0+.  If  0  is  piecewise  continuous,  i.e.,  it  has  only  finitely 
many  jump  discontinuities  in  any  bounded  interval,  then  u(x,  t)  is  a  solution 
to  the  heat  equation  on  x  £  R,  t  >  0;  and,  as  t  0+,  the  solution  approaches 
the  average  value  of  the  left  and  right  limits  at  a  point  of  discontinuity  of  </>;  in 
symbols, 

1 

u(x,  t)  —  (<; i>{x~ )  +  </>(x+))  as  £  — 0+.  □ 


This  discussion  of  the  Cauchy  problem  for  the  heat  equation  has  been  intu¬ 
itive,  and  it  provides  a  good  basis  for  understanding  why  the  solution  has  the 
form  it  does. 

There  is  another,  standard  way  to  write  the  solution  (2.8)  to  the  Cauchy 
problem  (2.1)-(2.2).  If  we  change  variables  in  the  integral  using  the  substitution 
r  =  (x  —  y)/yj4kt,  then  dr  =  —dy/V^kt,  and  (2.8)  becomes 


u(x,  t )  = 


This  formula  is  called  the  Poisson  integral  representation. 


We  make  several  observations.  First,  the  solution  of  the  Cauchy  problem  is 
an  integral  representation.  Although  the  formula  is  not  complicated,  for  most 
initial  conditions  (j)(x)  the  integration  cannot  be  performed  analytically.  There¬ 
fore,  numerical  or  computer  evaluation  of  the  integral  is  ultimately  required  if 
temperature  profiles  are  desired.  Also,  notice  that  the  temperature  u(x,  t)  is 
nonzero  for  every  real  x,  even  if  </>  is  zero  outside  a  small  interval  about  the 
origin.  Thus,  a  signal  propagated  by  the  heat,  or  diffusion,  equation  travels 
infinitely  fast;  according  to  this  model,  if  odors  diffuse,  a  bear  would  instantly 
smell  a  newly  opened  can  of  tuna  ten  miles  away.  Next,  although  we  do  not  give 
a  proof,  the  solution  given  by  (2.8)  is  very  smooth;  that  is,  u  is  infinitely  differ¬ 
entiable  in  both  x  and  t  in  the  domain  t  >  0;  this  is  true  even  if  </>  is  piecewise 
continuous.  Initial  signals  propagated  by  the  heat  equation  are  immediately 
smoothed  out. 

Finally,  we  note  that  the  heat  kernel  G(x,  t)  defined  in  (2.7)  is  also  called  the 
Green’s  function  for  the  Cauchy  problem.  In  general,  the  Green’s  function  for 
a  problem  is  the  response  of  a  system,  or  the  effect,  caused  by  a  point  source.  In 
heat  flow  on  the  real  line,  G(x,t)  is  the  response,  i.e.,  the  temperature  surface 
caused  by  a  unit,  point  heat  source  given  to  the  system  at  x  =  0,  t  =  0.  Some 
of  the  references  discuss  the  construction  of  a  Green’s  functions  for  a  variety  of 
problems.  Because  of  the  basic  role  this  function  plays  in  diffusion  problems, 
G(x,  t)  is  also  called  the  fundamental  solution  to  the  heat  equation.  The 
reader  should  review  Section  1.4  for  a  discussion  of  the  origin  of  the  fundamental 
solution  from  a  probability  discussion. 
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EXERCISES 


1.  Solve  the  Cauchy  problem  (2.1)-(2.2)  for  the  following  initial  conditions. 


a)  (j)(x)  =  1  if  \x\  <  1  and  (j){x)  =  0  if  \x\  >  1. 

b)  <p(x)  =  e~x ,  x  >  0;  (j>{x)  =0,  x  <  0. 

In  both  cases  write  the  solutions  in  terms  of  the  erf  function.  Hint:  In  (b) 
complete  the  square  with  respect  to  y  in  the  exponent  of  e. 


2.  If  |0(x)|  <  M  for  all  x,  where  M  is  a  positive  constant,  show  that  the 
solution  u  to  the  Cauchy  problem  (2.1)-(2.2)  satisfies  \u(x,t)\  <  M  for  all 
x  and  t  >  0.  Hint:  Use  the  calculus  fact  that  the  absolute  value  of  an  integral 
is  less  than  or  equal  to  the  integral  of  the  absolute  value:  |  J  f\  <  /i/i- 


3.  Consider  the  problem  (2.3)-(2.4)  with  uo  =  1.  For  a  fixed  x  =  xq,  what  is 
the  approximate  temperature  w(xo,t)  for  very  large  tl  Hint:  Expand  the 
integrand  in  the  formula  for  the  solution  in  a  power  series  and  integrate 
term  by  term. 


4.  Show  that  if  u(x,t)  and  v(x,t)  are  any  two  solutions  to  the  heat  equation 
(2.1),  then  w(x,y,t)  =  u(x,t)v(y,t)  solves  the  two-dimensional  heat  equa¬ 
tion  wt  =  k(wxx  H -Wyy).  Can  you  guess  the  solution  to  the  two-dimensional 
Cauchy  problem 


Wt  =  k(wxx  +  Wyy),  (. x ,  y)  e  R2,  t  >  0, 
w{x,  y,  0)  =  ip{x,  y),  (x,  y)  €  R2? 


5.  Let  the  initial  temperature  in  the  Cauchy  problem  (2.1)-(2.2)  be  given  by 
4>{x)  =  e_l:r+2l  with  k  =  1.  Use  the  numerical  integration  opera¬ 

tion  in  a  computer  algebra  package  to  draw  temperature  profiles  at  several 
times  to  illustrate  how  heat  flows  in  this  system.  Exhibit  the  temperature 
profiles  on  a  single  set  of  coordinate  axes. 


6. 


Verify  that 


f‘OQ 

/  G(x,t)dx 


=  1, 


t  >  0. 


J  —  oo 

Hint:  Change  variables  as  in  the  derivation  of  Poisson’s  integral  represen¬ 
tation. 


7.  Consider  the  Cauchy  problem  for  the  heat  equation 


Ut  =  kuxx ,  xgM,  t  >  0;  u(x,  0)  =  e  x ,  x  E  R. 

Verify  that  u(x,  t)  =  e~xJrkt  is  an  unbounded  solution.  Is  this  a  contradic¬ 
tion  to  the  theorem? 


2.2  Cauchy  Problem  for  the  Wave  Equation 
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2.2  Cauchy  Problem  for  the  Wave  Equation 


The  one-dimensional  wave  equation  is 


^tt  C  ^xx  —  0< 


(2.10) 


We  observed  in  Section  1.5  that  it  models  the  amplitude  of  transverse  dis¬ 
placement  waves  on  a  taut  string  as  well  as  small  amplitude  disturbances  in 
acoustics.  It  also  arises  in  electromagnetic  wave  propagation,  in  the  mechani¬ 
cal  vibrations  of  elastic  media,  as  well  as  in  other  problems.  It  is  a  hyperbolic 
equation  and  is  one  of  the  three  fundamental  equations  in  PDEs  (along  with 
the  diffusion  equation  and  Laplace’s  equation).  Under  the  transformation  of 
variables  (to  characteristic  coordinates) 


£ 


X 


Ct ,  T  =  X  +  ct, 


the  wave  equation  is  transformed  into  the  canonical  form 

Urt  =  0,  U  =  U(£,t), 

which  can  be  integrated  twice  to  obtain  the  general  solution 

U(Z,t)=F(0+G(t), 

where  F  and  G  are  arbitrary  functions.  Thus,  the  general  solution  to  (2.10)  is 


u(x,  t)  =  F(x  —  ct)  +  G(x  +  ct) 


(2.11) 


Hence,  solutions  of  the  wave  equation  are  the  superposition  (sum)  of  right-  and 
left-traveling  waves  moving  at  speed  c. 

The  Cauchy  problem  for  the  wave  equation  is 

Utt-c2uxx  =  0,  x  e  ,  t  >  0,  (2-12) 

u(x,Q)  =  f(x),  Ut(x,  0)  =  g(x),  xGl.  (2.13) 

Here,  /  defines  the  initial  displacement  of  an  infinite  string,  and  g  defines  its 
initial  velocity.  The  equation  is  second-order  in  t,  so  both  the  position  and 
velocity  must  be  specified  initially. 

There  is  a  simple  analytical  formula  for  the  solution  to  the  Cauchy  problem 

(2.12) -(2.13).  It  is  called  d’Alembert’s  formula,  and  it  is  given  by 

1  1  rx+ct 

u(x,t)  =  -(f(x-ct)+f(x  +  ct))  +  —  g(s)ds.  (2.14) 

2  2c  Jx_ct 

If  f"  and  g'  are  continuous,  then  it  is  a  straightforward  exercise  in  differential 
calculus,  using  Leibniz’s  formula,  to  verify  that  this  formula  solves  (2.12)- 

(2.13) .  The  formula  can  be  derived  (see  Exercise  1)  by  determining  the  two 
functions  F  and  G  in  (2.11)  using  the  initial  data  (2.13). 
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Example  2.2 

Insight  into  the  behavior  of  solutions  comes  from  examining  the  special  case 
where  the  initial  velocity  is  zero  and  the  initial  displacement  is  a  bell-shaped 
curve.  Specifically,  we  consider  the  problem  (with  c  =  2) 


utt  ~  4 uxx  =  0,  x  G  ,  t  >  0, 


u(x,  0)  =  e 


Ut(x,  0)  =  0,  xGl. 


The  exact  solution  is,  by  d’Alembert’s  formula, 


u(x,  t ) 


( x  2 1)  _|_  g  (x-\-2t)  j 


Either  the  solution  surface  or  wave  profiles  can  be  graphed  easily  using  a  com¬ 
puter  algebra  package.  Figure  2.3  shows  the  solution  surface;  observe  how  the 
initial  signal  at  t  =  0  splits  into  two  smaller  signals,  and  those  travel  off  in  oppo¬ 
site  directions  at  speed  c  =  2.  In  the  exercises  the  reader  is  asked  to  examine 
the  case  where  /  =  0  and  g  ^  0;  this  is  the  case  where  the  initial  displacement 
is  zero  and  the  string  is  given  an  initial  velocity,  or  impulse,  by,  say,  striking 
the  string  with  an  object.  □ 


Figure  2.3  Time  profiles  of  the  solution  surface.  The  initial  signal  splits  into 
two  signals  which  move  at  speeds  c  and  —  c  along  the  positive  and  negative 
characteristics,  x  —  ct  =  const.,  x  +  ct  =  const.,  respectively 


Close  examination  of  d’Alembert’s  formula  reveals  a  fundamental  property 
of  the  wave  equation.  If  the  initial  disturbance  is  supported  in  some  interval 
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a  <  x  <  b  (this  means  that  it  is  zero  outside  that  interval,  so  the  signal  is 
located  only  in  a  <  x  <  b),  then  the  signal  is  always  zero  outside  the  region 
bounded  by  the  two  straight  lines  x  +  ct  =  a  and  x  —  ct  =  b.  See  Figure  2.4. 


Figure  2.4  Region  of  influence  and  domain  of  dependence 


This  region  is  called  the  region  of  influence  of  the  interval  [a,  b\.  An 
initial  signal  in  [a,  b]  can  never  affect  the  solution  outside  this  region.  The  lines 
x  +  ct  =  constant  are  paths  in  space-time  along  which  signals  are  propagated 
at  velocity  —c;  the  lines  x  —  ct  =  constant  are  paths  in  space-time  along  which 
signals  are  propagated  with  velocity  c.  These  two  families  of  lines  are  called  the 
negative  and  positive  characteristics,  respectively.  If  the  interval  [a,  b]  is 
shrunk  to  a  point,  then  the  region  of  influence  shrinks  to  a  cone,  which  is  called 
the  light  cone.  Looking  at  the  situation  in  reverse,  we  can  ask  what  initial  data 
can  affect  the  solution  at  a  point  (xo,to).  From  the  d’Alembert  formula,  only 
the  initial  values  in  the  interval  [xo  —  cto,xo  +  cto\  will  affect  the  solution  at 
(xo,  to).  This  interval  is  called  the  domain  of  dependence,  and  it  is  found  by 
tracing  the  characteristics  emanating  from  the  point  (xq^q)  backward  in  time 
to  the  x-axis. 

In  summary,  there  are  important  points  to  note  regarding  the  characteristic 
curves.  First,  they  are  curves  that  carry  the  signals  forward  in  space-time  with 
velocity  c  and  —  c.  Second,  they  define  a  special  coordinate  system  £  =  x  —  ct, 
r  =  x  +  ct  under  which  the  wave  equation  uu  —  c2uxx  =  0  is  reduced  to  the 
simple  canonical  form  u^T  =  0.  In  hyperbolic  problems  there  is  always  a  set 
of  characteristic  curves  that  play  these  roles.  Even  first  order  PDEs,  which 
are  actually  wave-like,  have  one  family  of  such  curves  that  carry  signals  and 
provide  a  distinguished  coordinate  system  where  the  problem  simplifies  (recall 
the  examples  in  Section  1.2). 

Finally,  we  point  out  again  the  important  differences  between  parabolic  and 
hyperbolic  problems.  Hyperbolic,  or  wave-like,  equations  propagate  signals  at 
a  finite  speed  along  characteristics;  there  is  coherency  in  the  wave  form  as 
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it  propagates,  and  therefore  information  is  retained.  Parabolic,  or  diffusion, 
equations  propagate  signals  at  infinite  speed;  because  the  signals  diffuse  or 
smear  out,  there  is  a  gradual  loss  of  information.  A  good  way  to  understand 
how  different  equations  propagate  information  is  to  determine  how  a  signal 
is  propagated  in  a  special  case.  For  example,  suppose  the  initial  signal  is  a 
Gaussian  function  or  bell-shaped  curve  exp(— x2).  Think  of  this  signal  as  being 
a  bit  of  information.  The  convection  equation  ut  +  cux  =  0,  which  is  a  wave-like 
equation,  propagates  this  signal  via 

u(x,  t )  =  e-G-cb  . 


That  is,  it  moves  it  at  speed  c  without  distortion.  The  wave  equation  utt  = 
c2uxx  moves  it  via 

u(x,t)  =  0.5(e~(x~ct)2  +  e~(x+ct)2). 


So  the  signal  breaks  into  two  pieces,  and  they  propagate  in  opposite  directions 
at  speed  c.  The  diffusion  equation  ut  =  kuxx  propagates  the  signal  via 

u{x,t)  =  1  p-xa/(l+4  kt) 

v  ’  VI  +  4  kt 

So  the  signal  stays  at  the  same  place,  but  it  spreads  out  and  decreases  in 
amplitude.  Any  information  in  the  signal  is  eventually  lost. 


EXERCISES 


1.  Derive  d’Alembert’s  formula  (2.14)  by  determining  the  two  arbitrary  func¬ 
tions  F  and  G  in  the  general  solution  (2.11)  using  the  initial  conditions 
(2.13). 


2.  Calculate  the  exact  solution  to  the  Cauchy  problem  when  c  —  2,  the  initial 
displacement  is  f(pc)  =  0,  and  the  initial  velocity  is  g(x)  =  1/(1  +  0.25x2). 
Plot  the  solution  surface  and  discuss  the  effect  of  giving  a  string  at  rest  an 
initial  impulse.  Contrast  the  solution  with  the  case  when  f  ^  0  and  g  =  0. 


3.  Solve  the  Cauchy  problem 


'U'tt  —  C  ^ xx  •> 


u(x,  0)  =  4>(x),  Ut(x,  0)  =  0, 


by  differentiating  the  solution  to  to  the  Cauchy  problem 

Utt  =  C2uxx,  u(x,  0)  =  0,  ut(x,  0)  =  4>(x). 


2.2  Cauchy  Problem  for  the  Wave  Equation 
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4.  Solve  the  outgoing  signal  problem 

Utt  ~  c2uxx  =  0,  x  >  0,  — oo  <  t  <  oo;  ux(0,t)  =  s(t ),  — oo  <  t  <  oo, 
where  s(t)  is  a  known  signal.  Hint:  Look  for  a  right-traveling  wave  solution. 

5.  The  three-dimensional  wave  equation  is 

utt  ~  c2Au  =  0, 

where  u  =  u(x,  y ,  z,  t)  and  A  is  the  Laplacian  operator.  For  waves  with 
spherical  symmetry,  u  =  u(p,  t),  where  p  =  x2  +  y2  +  z2.  In  this  special 
case  the  Laplacian  is  given  by  (Section  1.8)  An  =  upp  +  ^up.  By  introducing 
a  change  of  dependent  variable  U  =  pu,  show  that  the  general  solution  for 
the  spherically  symmetric  wave  equation 

2/  2  \ 

Utt  —  c  V^pp  T  ~^p) 

p 


1 

u  =  ~(F(p  —  ct )  +  G(p  +  ct)). 

P 

Why  do  you  think  an  outward-moving  wave  u  =  F(p  —  ct)/ p  decays  in 
amplitude?  Give  a  physical  interpretation. 

6.  Solve  the  Cauchy  problem 

Utt  -  C2uxx  =  0,  X  G  K,  t  >  0, 

u(x,0)  =  ut(x,  0)  =  cosx,  xGi 

Use  a  computer  algebra  program  to  graph  wave  profiles  at  t  =  1,2,3.  Take 
c—  1. 

7.  In  Section  1.7  we  showed  that  any  solution  to  Laplace’s  equation  has  the 
property  that  its  value  at  a  point  is  approximately  the  average  of  four 
nearby  values  surrounding  the  point.  Can  we  make  a  statement  about  solu¬ 
tions  to  the  wave  equation?  Consider  any  characteristic  parallelogram  (see 
Figure  2.5)  whose  sides  are  positive  and  negative  characteristics,  and  let 
A,  F),C,D  be  the  vertices  as  shown.  Show  that  any  solution  to  the  wave 
equation  satisfies  the  relation 

u(A)  +  u(C)  =  u(B)  +  u(D). 


8.  Let  u  =  u(x,t)  solve  the  wave  equation;  show  that  v  =  v(x,t)  defined  by 


v(x,  t )  = 


c 


*oo 


u(s,  x)e  s  c  /(4kt^ds. 


y/Airkt  J - 


oo 


solves  the  heat  equation. 
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Figure  2.5  Characteristic  parallelogram  ABCD 


9.  Using  the  hint  below  and  verifying  the  steps,  find  a  particular  solution  to 
the  nonhomogeneous  wave  equation 

Utt  ~  c2uxx  =  f(x,  £),  x  G  M,  t  >  0;  u(x,  0)  =  Ut(x,  0)  =  0,  xGl. 

Hint:  Integrate  the  equation  over  the  characteristic  triangle  T  with  corners 
(f  -  cr,  0),  (f  +  cr,  0),  (f ,  r)  to  get 

J  j  f  dxdt  =  J  {utt  —  c2uxx)  dxdt 

=  /  —c2ux  dt  —  ut  dx  (by  Green’s  Theorem) 

Jc 

=  2  cu(£,t). 

This  gives  the  solution  at  any  point  (£,  r).  Here,  C  is  the  boundary  of  T. 


2.3  Well-Posed  Problems 

In  the  last  two  sections  we  solved  the  Cauchy  problem,  or  pure  initial  value 
problem,  for  the  heat  equation  and  for  the  wave  equation.  Now  let  us  set  up 
a  similar  problem  for  Laplace’s  equation.  Immediately  the  reader  should  be 
skeptical  because  Laplace’s  equation  is  an  elliptic  equilibrium  equation  that 
does  not  involve  time;  we  expect  boundary  conditions  rather  than  initial  con¬ 
ditions.  Nevertheless,  let  us  consider  the  two-dimensional  Laplace’s  equation  in 
the  upper-half  plane  y  >  0  and  prescribe  u  and  its  y  derivative  uy  along  y  =  0, 
or  the  x  axis.  That  is,  let  us  consider  the  problem 

^ xx  H-  ^ yy  —  x  G  M,  y  0, 

u(x,0)  =  f(x),  uy(x,0)  =  g(x),  xGl. 


(2.15) 

(2.16) 


2.3  Well-Posed  Problems 
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Notice  the  similarity  to  the  initial  value  problem  for  the  wave  equation  (2.12)- 
(2.13);  but  here  we  are  using  y  instead  of  £,  and  the  equation  is  elliptic  rather 
than  hyperbolic.  As  an  aside,  observe  that  (2.15)-(2.16)  is  neither  a  Dirichlet 
problem  nor  a  Neumann  problem  because  both  u  and  uy  are  specified  along 
the  boundary  y  —  0. 

Let  us  carefully  analyze  problem  (2.15)-(2.16)  in  a  special  case.  Suppose 
first  that  the  boundary  conditions  are 

f(x)  =  g{  x)  =  0. 

Then  it  is  clear  that  a  solution  (the  only  solution)  to  (2.15)-(2.16)  is  the  zero 
solution  u(x,  y)  =  0  for  all  x  E  R,  y  >  0.  Now  let  us  change  the  conditions 
along  the  boundary  by  taking 

1 

fix)  =  —  cos  nay  g(x)  =0,  igM. 
n 

Then,  as  one  can  easily  check,  the  solution  to  (2.15)-(2.16)  is 

u(x,y)  =  —  cos  nx  cosh  ny. 
n 

Suppose  n  is  large  (say  n  =  100);  then  we  have  changed  the  boundary  condition 
u(x,  0)  =  f(x)  by  only  a  small  amount.  Yet  the  solution  has  changed  from  zero 
by  a  large  amount!  For  example,  along  the  line  x  =  0  (y  axis)  the  solution  is 

.  cosh  ny 

n(0,  y)  = - >  Too  as  y  oo. 

n 

To  summarize,  in  this  problem  a  small  change  of  the  data  on  the  boundary 
produced  a  large  change  in  the  solution.  This  behavior  is  disturbing,  because 
in  a  physical  problem  we  expect  that  the  solution  should  depend  continuously 
on  the  boundary  data — a  small  change  in  the  boundary  data  should  produce  a 
small  change  in  the  solution.  After  all,  we  want  to  be  confident  in  the  accuracy  of 
our  solution  even  if  we  get  the  boundary  data  only  approximately  correct.  This 
latter  property  of  continuous  dependence  on  data  is  called  stability,  and  the 
problem  (2.15)-(2.16)  does  not  have  it.  So  the  Cauchy  problem  for  Laplace’s 
equations  (2.15)-(2.16)  is  not  a  well-posed  physical  problem.  In  fact,  since 
Laplace’s  equation  models  steady  heat  flow,  it  seems  physically  reasonable  that 
we  need  only  specify  the  temperature  u(x,  0)  along  the  boundary  to  be  able  to 
solve  the  problem,  or  specify  the  flux  uy(x,  0),  but  not  both  as  in  (2.16). 

The  term  well-posed  in  PDEs  has  a  technical  meaning.  We  say  that  a 
boundary  value  problem,  initial  value  problem,  or  an  initial  boundary  value 
problem  is  well-posed  if: 
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(1)  (Existence)  it  has  a  solution 

(2)  (Uniqueness)  the  solution  is  unique 

(3)  (Stability)  the  solution  depends  continuously  on  the  initial  and/or  bound¬ 
ary  data. 

If  a  problem  is  not  well-posed,  then  it  is  called  ill-posed.  Resolving  these  three 
issues  for  various  PDE  models  occupies  much  of  the  theory  of  PDEs. 

Now  that  we  have  shown  that  the  Cauchy  problem  for  Laplace’s  equation 
is  not  stable,  the  reader  may  be  skeptical  about  other  problems  such  as  the 
Cauchy  problem  for  the  heat  equation.  We  can  easily  observe  that  solutions  to 
this  problem  have  the  stability  property. 


Example  2.3 

Consider  the  two  problems 


and 


ut 

u(x ,  0) 


t  0, 

</>(#),  xGl, 


vt 

v(x,  0) 


kvxx ,  x  e  M,  t  >  0, 
^(x),  xGl, 


where  </>  and  ip  are  continuous,  bounded  functions  and  close  in  the  sense  that 
\4>(x)  —  pj(x)\  <  8  for  all  x,  where  8  is  a  small  number.  We  would  like  to 
show  that  the  corresponding  solutions  u(x,t )  and  v(x,t)  are  close.  We  define 
w(x,t )  =  u(x,t )  —  v(x,t)  and  note  that  w  satisfies  the  Cauchy  problem 


wt  =  kwxx ,  x  e  >  0, 
w(x,  0)  =  (p{x)  —  ^(x),  xGl. 

The  solution  formula  for  the  Cauchy  problem  gives 

•  oo 


/CXJ 

Wy)  -  ip(y))G(x  -  y,  t)dy, 

-oo 

where  G(x,  t)  is  the  fundamental  solution.  Therefore,  for  each  t  >  0, 

/oo 

I  4>{y)  -  V’(y)  II  G(x  -  y,t)  I  dy 

-oo 


—  oo 
•  oo 


< 


/oo 

SG(x  —  y ,  t)dy  =  S, 

-oo 
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since  f  G(x  —  y,t)dy  =  1.  Therefore,  in  the  sense  interpreted  above,  closeness 
of  the  initial  data  implies  closeness  of  the  solution.  □ 


EXERCISES 

1.  Show  that  the  Cauchy  problem  for  the  backward  diffusion  equation, 

Uf  T  Uxx  —  0?  x  G  M,  t  0, 
u(x,  0)  =  /(#),  x  G  R, 

is  unstable  by  considering  the  solutions 

/  \  1  „  2  i 

u(x,  t)  =  1  H — e  sinnx 

n 

for  large  n.  Hint:  Follow  the  method  used  in  this  section  for  Laplace’s 
equation. 

2.  Let  u  =  u(x,  y).  Is  the  problem 

uxy  =  0,  0  <  x,y  <  1, 

on  the  unit  square,  where  the  value  of  u  is  prescribed  on  the  boundary  of 
the  square,  a  well-posed  problem?  Discuss  fully. 

3.  Consider  the  two  Cauchy  problems  for  the  wave  equation  with  different 
initial  data: 


ultt  =  c2ulxx ,  xGl,  0  <  t  <  T, 
id(ay0 )  = /z(x),  u\(x,  0)  =  gl(x),  xGl, 

for  %  =  1,2,  where  Z1,/2,^1,  and  g 2  are  given  functions  (the  superscripts 
are  indices  and  not  exponents).  If  for  all  x  G  R  we  have 


Z1^)  -  f2(x )  |<  <5i, 


show  that  |  rz1(x,t)  —  u2(x,t)  \  <  5i  +  for  all  x  G  R,  0  <  t  <  T. 
What  does  this  mean  with  regard  to  stability?  Hint:  Follow  the  method  in 
Example  2.3. 
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2.4  Semi-Infinite  Domains 

Heat  Equation 

In  Sections  2.1  and  2.2  we  solved  the  heat  equation  and  the  wave  equation, 
respectively,  on  the  domain  —  oo  <  x  <  oo.  Now  we  study  these  equations 
when  the  domain  is  semi-infinite,  i.e.,  on  the  interval  0  <  x  <  oo.  This  means 
that  there  is  a  boundary  in  the  problem,  at  x  =  0,  and  one  expects  that  it 
is  necessary  to  impose  a  boundary  condition  there.  For  example,  to  determine 
how  the  temperature  distribution  evolves  in  a  semi-infinite  bar,  one  should 
know  the  temperature  in  the  bar  initially,  as  well  as  the  temperature  at  x  =  0 
for  all  time. 

To  begin,  we  consider  the  initial  boundary  value  problem  for  the  heat  equa¬ 
tion 


ut  =  kuxx ,  £>0,t>0,  (2.17) 

i^(0,  t)  =  0,  t>  0,  (2.18) 

u(x,  0)  =  4>(x),  x  >  0,  (2.19) 

where  we  specify  the  temperature  to  be  zero  at  x  =  0  for  all  time.  To  solve 
this  problem  we  use  the  method  of  reflection  through  the  boundary.  The  idea 
is  to  extend  the  problem  (2.17)-(2.19)  to  the  entire  real  axis  by  extending  the 
initial  data  </>  to  an  odd  function  defined  by 

#t)  =  (  t{,x)\  “V' 

\  —  </>(— x),  x  <  0, 

with  -0(0)  =0.  We  then  solve  the  extended  problem  by  formula  (2.8)  and  then 
restrict  that  solution  to  the  positive  real  axis,  which  will  then  be  the  solution 
to  (2.17)-(2.19).  Figure  2.6  shows  the  initial  data  for  the  extended  problem 
and  a  resulting  odd  solution  profile  v(xfl).  Physically,  we  are  attaching  a  bar 
occupying  the  space  —  oo  <  x  <  0  and  giving  it  an  initial  temperature  equal 
to  the  negative  of  that  in  the  original  bar.  Thus,  let  us  consider  the  Cauchy 
problem  for  v(x,t): 


vt  =  kvxx ,  x  e  R,t  >  0, 
v(x,  0)  =  ^(x),  x  G  R, 


(2.20) 

(2.21) 


where  is  the  odd  extension  of  the  function  <fi  as  described  above.  By  the 
formula  for  the  solution  to  the  heat  equation  over  the  real  line,  the  solution  to 
(2.20)-(2.21)  is  given  by 


•oo 


G(x 


y,t)ip(y)dy, 


v(x,  t ) 


—  oo 
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Figure  2.6  The  initial  data  ^(x)  and  solution  profile  v(x,t)  for  the  odd, 
extended  problem 


where  G(x,  y )  is  the  fundamental  solution.  Breaking  this  integral  into  two  parts, 
y  <  0  and  y  >  0,  we  obtain 


v(x,  t)  = 


0  poo 

G(x-y,t)ip(y)dy+  G(x  -  y,t)ip(y)dy 

J  0 


-oo 


■  oo 
*oo 


>0  poo 

G(x-y,t)<p(-y)dy+  G(x  -  y,t)<f>(y)dy 

Jo 

p  oo 

G(x  +  y,t)<j>(y)dy+  /  G(x  -  y,t)<f>(y)dy 

0  Jo 

•oo 

[G(x  —  y,t)  —  G(x  +  y,  t)]<f>(y)dy. 


0 


We  restrict  this  solution  to  x  >  0,  and  therefore  the  solution  to  the  heat 
equations  (2.17)-(2.19),  on  the  domain  x  >  0,  is 

p  oo 

u(x,t)  =  /  [G(x  —  y,t)  —  G(x  +  y,t)]cj)(y)dy,  x>0. 

Jo 


The  Wave  Equation 

The  wave  equation  on  a  semi-infinite  domain  can  be  solved  in  the  same 
manner.  Consider  the  problem  of  the  transverse  vibrations  of  a  string  occupying 
x  >  0  when  the  end  at  x  =  0  is  held  fixed.  The  initial  boundary  value  problem 

Utt  —  C  U xxi  x  0,  t  0, 
u( 0,  t)  =  0,  t  >  0, 

u(x,  0)  = /(x),  ut(x,  0)  =  g(x),  x  >  0. 


is 


(2.22) 

(2.23) 

(2.24) 
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For  x  >  ct  (i.e.,  ahead  of  the  leading  signal  x  =  ct  coming  from  the  origin)  the 
interval  of  dependence  lies  in  (0,  oo),  where  the  initial  data  are  given;  therefore, 
in  this  domain,  the  solution  is  given  by  d’Alembert’s  formula: 

1  1  rx+ct 

u(x,t)  =  —(f(x  —  ct)  +  f{pc  +  ct))  4 - I  g(s)ds ,  x  >  ct.  (2.25) 

2  2c  Jx-Ct 

Next,  the  data  given  along  the  x  =  0  boundary  cannot  affect  the  solution  in 
the  region  x  >  ct,  since  signals  travel  outward  from  the  boundary  at  speed  c. 
See  Figure  2.7  To  solve  the  problem  in  the  region  0  <  x  <  ct  we  proceed  as  we 


Figure  2.7  Space-time  domain  x,t  >  0  where  the  problem  (2.22)-(2.24)  is 
defined.  The  region  x  >  ct  is  affected  only  by  the  initial  data  /  and  g  and  can 
be  found  by  d’Alembert’s  formula 


did  for  the  heat  equation  and  extend  the  initial  data  /  and  g  to  odd  functions 
on  the  entire  real  axis.  Therefore,  we  consider  the  problem 

vtt  =  c2vxx,  xeR,t>0,  (2.26) 

c(0,t)  =  0,  t  >  0,  (2.27) 

v(x,  0)  =  F(x),  Vt(x,0)  =  G(x),  xGl,  (2.28) 

where 

F(x)  =  f(x),  x  >  0;  F( 0)  =  0;  F(x)  =  — /(— x),  x  <  0, 

and 

G(x)  =  g(x),  x  >  0;  G(0)  =  0;  G(x)  =  —g(—x),  x  <  0. 

By  d’Alembert’s  formula  the  solution  to  (2.26)-(2.28)  is 

1  1  rx+ct 

v(x,  t)  =  —(F(x  —  ct)  +  F(x  +  ct))  H - /  G(s)ds. 

2  2c  Jx_ct 
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In  the  region  x  <  ct  this  becomes  (since  x  —  ct  <  0) 


1  1  r 

v(x,  t)  =  -(-/(-z  +  ct)  +  f(x  +  ct))  +  —  /  —g(—s)ds 

2  2c  Jx_ct 


+ 


1 


•cc+ct 


g(s)ds. 


Jo 

If  the  variable  of  integration  s  in  the  first  integral  is  replaced  by  —  s,  then  the 
two  integrals  can  be  combined,  and  we  may  write 

1  1  rct+x 

u(x,  t)  =  v(x,  t)  =  - (f(x  +  ct)  -  f(ct  -  x))  +  —  J  g(s)ds ,  (2.29) 


ct—x 


0  <  X  <  ct. 


In  summary,  the  solution  to  the  initial  boundary  value  problem  (2.22)-(2.24) 
for  the  wave  equation  on  a  half-line  is  given  by  the  two  formulas  (2.25)  and 
(2.29),  depending  on  x  >  ct  or  x  <  ct. 

Why  does  this  reflection  method  work  for  the  heat  equation  and  wave  equa¬ 
tion?  To  reiterate,  the  solutions  to  the  Cauchy  problems  for  these  two  equations 
are  odd  functions  if  the  initial  data  is  odd.  And  the  restriction  of  an  odd  solu¬ 
tion  to  the  positive  real  axis  is  the  solution  to  the  given  initial  boundary  value 
problem.  If  this  intuitive  reasoning  leaves  the  reader  perplexed,  then  one  can 
always  verify  analytically  that  the  solutions  we  have  obtained  by  this  reflection 
method  are,  in  fact,  solutions  to  the  given  problems. 

Finally,  if  the  boundary  condition  (2.18)  along  x  =  0  in  the  heat  flow 
problem  (2.17)-(2.19)  is  replaced  by  a  Neumann  condition 

ux(0,t)  =  0,  t  >  0, 

then  the  problem  can  be  solved  by  extending  the  initial  data  to  an  even  func¬ 
tion.  The  same  is  true  for  the  wave  equation.  We  leave  these  calculations  as 
exercises. 


EXERCISES 

1.  Solve  the  initial  boundary  value  problem  for  the  heat  equation, 

ut  =  kuxx ,  x  >  0,  t  >  0, 
ux(0,t)  =  0,  t  >  0, 
u(x,  0)  =  </>(#),  x  >  0, 

with  an  insulated  boundary  condition,  by  extending  the  initial  temperature 
(j)  to  the  entire  real  axis  as  an  even  function.  The  solution  is 

poo 

u(x,y)=  /  [G(x-y,t)  +  G(x  +  y,t)\<p(y)dy. 

Jo 
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2.  Find  a  formula  for  the  solution  to  the  problem 

Uf  —  kiLxx-)  x  >  0,  t  >  0, 
u( 0,  t)  =  0,  t  >  0, 
u(x,  0)  =  1,  x  >  0. 

Plot  several  solution  profiles  when  k  =  0.5. 

3.  Find  the  solution  to  the  problem 

lift  —  C  Vjxxi  %  ^  0,  t  0, 
u( 0,  t)  =  0,  t  >  0, 

^(x,  0)  =  xe~x ,  Ut(x,  0)  =  0,  x  >  0. 

Pick  c  =  0.5  and  sketch  several  time  snapshots  of  the  solution  surface  to 
observe  the  reflection  of  the  wave  from  the  boundary. 

4.  Solve  the  problem 

Uf  —  kuXx i  x  0,  t  0, 
u( 0,  t)  =  1,  t  >  0, 
u(x,  0)  =  0,  x  >  0. 

Hint:  Transform  the  problem  to  one  of  the  form  (2.17)-(2.19)  and  use 
Exercise  2. 

5.  (Age  of  the  earth)  In  this  exercise  we  use  Lord  Kelvin’s  argument,  given 
in  the  mid  1860s,  to  estimate  the  age  of  the  earth  using  a  measurement 
of  the  geothermal  gradient  at  the  surface.  The  geothermal  gradient  is  the 
temperature  gradient  ux  measured  at  surface  of  the  earth.  To  obtain  the 
estimate,  treat  the  earth  as  flat  at  the  surface  with  x  >  0  measuring  the 
depth  from  the  surface  x  =  0.  Take  the  diffusivity  of  the  earth  to  be 
k  =  0.007  cm2  per  second,  the  assume  that  initial  temperature  was  7000 
degrees  Fahrenheit  (molten  rock)  at  the  beginning.  Finally,  assume  the 
temperature  of  the  surrounding  atmosphere  has  always  been  0  degrees.  Use 
a  current  geothermal  gradient  value  of  3.7  x  10-4  degrees  per  cm.  After 
determining  the  approximate  age  of  the  earth,  estimate  the  percentage  of 
the  original  heat  that  has  been  lost  until  the  present  day?  Comment  on  the 
accuracy  of  the  Kelvin’s  argument  by  searching  on  line  for  the  approximate 
age  of  the  earth.  Hint:  Treat  this  as  a  linear,  one-dimensional,  heat  flow 
problem. 

6.  (Subsurface  temperatures)  A  clutch  of  insect  eggs  lies  at  a  depth  of  x\  cm 
below  the  ground  surface. 
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a)  If  the  surface  is  subjected  to  periodic  temperature  variations  of  To  + 
A  cos  cot  over  a  long  time,  what  is  the  temperature  variation  experi¬ 
enced  by  the  egg  clutch?  Hint:  find  a  complex,  plane  wave  solution  of 
the  diffusion  equation  of  the  form  u  =  To  +  Ael^,x~u)t^  and  determine 
7  in  terms  of  uj  and  the  diffusivity  k  of  the  soil;  take  the  real  part. 

b)  At  depth  x\  find  the  phase  shift  of  the  temperature  and  the  amplitude 
attenuation  factor. 

c)  Plot  of  the  amplitude  of  the  temperature  variation  versus  depth.  Take 
To  =  30,  A  =  15  degrees  Celsius,  k  =  0.004  cm2  per  second,  and  lu  =  2tt 
per  day,  and  plot  the  temperature  variations  at  the  surface  and  at  the 
depth  3  cm. 

7.  The  heat  equation  governs  the  flow  of  heat  downward  (x  >  0)  through  the 
soil  due  to  changes  in  ambient  air  temperature  S(t)  at  the  surface  ( x  =  0) 
and  the  absorption  of  heat  caused  by  solar  radiation  W (t)  falling  on  the 
surface,  measured  in  Watts  per  unit  area  A,  per  time.  Let  u  =  u(x ,  t) 
denote  the  temperature  at  a  depth  x  at  time,  and  let  C,  p,  and  K  be 
the  specific  heat,  density,  and  thermal  conductivity  of  the  soil.  Also,  let  H 
denote  the  heat  transfer  coefficient  at  the  surface  and  a  be  the  fraction  of 
solar  energy  that  is  absorbed  at  the  surface.  Derive  the  boundary  condition 
at  the  surface, 


-AKux{ 0,  t)  =  -AH(u( 0,  t)  -  S(t ))  +  AaW (t), 

being  specific  about  the  origin  of  each  term.  Give  a  correct  set  of  units  for 
each  quantity  appearing.  (For  an  example  see  A.  Parrott  &  J.  D.  Logan, 
2010,  Ecological  Modelling  221,  1378-1393.) 


2.5  Sources  and  Duhamel’s  Principle 

How  do  we  proceed  if  the  PDE  contains  a  source  term?  For  example,  consider 
the  heat-flow  problem 


ut  =  kuxx  +  f(x,t),  xeR,t>0,  (2.30) 

u(x,  0)  =  0,  xGl,  (2.31) 

where  /  is  a  given  heat  source.  The  key  to  the  analysis  of  this  problem  can  be 
discovered  by  examining  an  ordinary  differential  equation  with  a  source  term. 
For  example,  consider  the  initial  value  problem 


y'(t)  +  ay  =  F(t),  t  >  0;  y(0)  =  0. 


(2.32) 
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Multiplying  by  the  integrating  factor  eat  makes  the  left  side  a  total  derivative, 
and  we  obtain 

±(eaty)=eatF(t). 

Integrating  from  0  to  t  then  gives 

eaty(t )  =  f  ear F(r)dr, 

Jo 


which  can  be  rewritten  as 


y(t)  =  [  e-a(t-T)F(T)dT.  (2.33) 

Jo 

Now  let  us  consider  another  problem,  where  we  put  the  source  term  in  as  the 
initial  condition.  Let  w  =  w(t;  r)  be  the  solution  to  the  problem 

w'(t]  r)  +  aw(t]  r)  =  0,  t  >  0;  w(0;r)  =  F(r), 


where  a  new  parameter  r  has  been  introduced.  It  is  straightforward  to  see  that 
the  solution  to  this  problem  is 

w(t;  t)  =  F(r)e~at. 

So  the  solution  to  (2.32),  the  problem  with  a  source,  is  the  integral  of  the  solu¬ 
tion  w(t,  t)  (with  t  replaced  by  t  —  r)  of  the  associated  homogeneous  problem 
where  the  source  is  included  as  an  initial  condition.  That  is, 

y(t)  =  f  w(t  —  r;r)F(r)  t  =  f  F(r)e_a^_T^  dr, 

Jo  Jo 

which  is  (2.33). 

The  fact  that  a  particular  solution  of  a  linear  equation  can  be  deduced 
from  the  solution  of  the  homogeneous  equation  is  called  Duhamel’s  principle. 
For  ODEs  we  state  the  principle  as  follows: 


Theorem  2.4 

(Duhamel’s  principle)  The  solution  of  the  problem 


y’(t)  +  ay  =  F(t),  t  >  0;  y( 0)  =  0 


is  given  by 


V(t)  =  /  W(t-T,  T)dT, 
JO 

where  w  =  w(t,r)  solves  the  homogeneous  problem 


w'(t;  t)  +  aw(t]  t)  =  0,  t  >  0;  w(0;r)  =  F(r).  □ 
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The  same  type  of  result  is  true  for  second-order  ODEs  as  well;  the  reader 
may  recall  that  the  variation  of  parameters  method  uses  the  homogeneous 
solutions  to  construct  a  particular  solution.  See  the  Appendix  for  the  formula. 
Now  let  us  extrapolate  this  idea  and  apply  it  to  the  heat  flow  problem 

(2.30) -(2.31).  If  Duhamel’s  principle  is  valid  in  this  case,  then  the  solution  of 

(2.30) -(2.31)  should  be 

u(x,t)  =  /  w(x,  t  —  r,  r)dr, 

J  o 

where  w(x,t;r)  solves  the  homogeneous  problem 

wt  =  kwXXl  x  e  ,t>  0,  (2.34) 

w(x,  0;  r)  =  /(x,  r),  xgR.  (2.35) 


In  fact,  we  can  write  down  the  explicit  formula;  by  (2.8)  the  solution  to  (2.34)- 
(2.35)  is 


w(x,  t ;  t) 


-oo 


y,t)f(y,r)dy, 


where  G  is  the  heat  kernel.  Therefore,  the  solution  to  (2.30)-(2.31)  should  be 
given  by 

pt  pOO 

u(x,t)=  /  G{x-y,t-T)f(y,T)dydT. 

J  0  J  —  oo 

Indeed,  one  can  verify  that  this  is  the  case. 

It  is  not  surprising  that  the  solution  turned  out  to  be  an  integral.  The  PDE 

(2.30)  has  the  form  Hu  =  /,  where  H  =  —  fcjp-  is  a  differential  operator. 

If  we  formally  write  u  =  H~x  f  (as  we  might  do  in  matrix  theory  if  H  were  a 
matrix  and  u  and  /  vectors),  then  we  would  expect  if-1,  the  inverse  of  i4,  to  be 
an  integral  operator,  since  integration  and  differentiation  are  inverse  processes. 
We  may  now  write  down  the  formula  for  the  solution  to  the  problem 


ut  =  kuxx  +  /(x,  £),  x  e  M,  t  >  0,  (2.36) 

u(x,  0)  =  </>(x),  x  G  R,  (2.37) 

where  the  initial  condition  is  no  longer  zero.  By  linearity,  the  solution  to  (2.36)- 
(2.37)  is  the  sum  of  the  solutions  to  the  two  problems 


'Ut  —  kuXXl  x  £  M,  t  0, 
u(x,  0)  =  </>(x),  x  £  R, 


ut  =  kuxx  +  f(x,t),  x  £  M,  t  >  0, 
u(x,  0)  =  0,  x  £  R. 


and 
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Thus,  the  solution  to  (2.36)-(2.37)  is 


/OO  nt  rOO 

G{x  -  y,t)<p(y)dy  +  /  /  G(x  -  y,t  -  T)f(y,r)dydT.  (2.38) 

-oo  J  ()  J  —  oc 

This  is  the  variation  of  parameters  formula  for  the  problem  (2.36)-(2.37). 


Example  2.5 

(Wave  equation)  Duhamel’s  principle  can  also  be  applied  to  the  wave  equa¬ 
tion.  The  solution  to  the  problem 


utt  -  C2uxx  =  f(x,t),  xeR,t>0, 
u(x ,  0)  =  Ut(x,  0)  =  0,  x  G  R, 


(2.39) 

(2.40) 


is 


u(x,t)  =  /  w(x,  t  —  r,  r)dr, 

Jo 

where  w  =  w(x,  t;  r)  is  the  solution  to 

-  c2^  =  0,  ieK,f>o, 
re(x,0;r)  =  0,  Wt(x,  0;  r)  =  /(x,  r),  xGl. 

We  put  the  source  term  /  into  the  initial  condition  for  wt  rather  than  for 
w  because  /,  like  wt,  is  an  acceleration;  note  that  the  displacement  u  is  a 
time  integral  of  w,  so  w  must  be  a  velocity,  making  Wt  an  acceleration.  From 
d’Alembert’s  formula, 

2  px+ct 

w{x,  t;r)  =  —  f(s,T)ds, 

^  J  x  —  ct 

and  therefore  by  Duhamel’s  principle  the  solution  to  (2.39)-(2.40)  is  given  by 
the  formula 

1  nt  pX  +  c(t  —  T) 

u(x,t)  =  —  /  /  f(s,r)dsdr.  □  (2.41) 

2C  J0  Jx  —  c(t—r ) 

Source  terms  also  arise  in  PDEs  when  problems  are  transformed  in  order 
to  homogenize  the  boundary  conditions. 


Example  2.6 

Consider  the  diffusion  problem 

u(x,  0) 
u(0,  t) 


kuxx,  x>0,t>0. 
4>(x),  x  >  0, 
g(t),  t  >  0. 


(2.42) 

(2.43) 

(2.44) 
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We  solved  this  problem  in  the  last  section  when  g(t)  =0.  So  let  us  attempt  to 
transform  the  problem  into  one  where  the  boundary  condition  is  zero.  To  this 
end,  let  x(x,  t )  =  u(x,  t)  —  g(t ),  or  u(x,  t)  =  x(x,  t)  +  g(t);  then  substituting  into 
(2.42)-(2.44)  gives 

vt  =  kvxx  -  g'(t ),  x  >  0,  t  >  0,  (2.45) 

x(x,  0)  =  'ip(x)  =  (j){x)  —  g(0),  x  >  0,  (2.46) 

x(0,t)  =  0,  £>0.  (2.47) 

Therefore,  transformation  of  the  dependent  variable  has  changed  the  problem 
into  one  with  a  homogeneous  boundary  condition,  but  a  price  was  paid — an 
inhomogeneity,  or  source  term  —  g'(t),  was  introduced  into  the  PDE.  In  general, 
we  can  always  homogenize  the  boundary  conditions  in  a  linear  problem,  but 
the  result  is  an  inhomogeneous  PDE;  so  inhomogeneous  boundary  conditions 
can  be  traded  for  inhomogeneous  PDEs.  We  can  solve  (2.45)-(2.47)  for  v(x,t) 
by  formulating  a  Duhamel’s  principle.  In  Section  2.6  we  show  that  Laplace 
transform  methods  can  also  be  applied  to  find  the  solution.  □ 


EXERCISES 

1.  Write  a  formula  for  the  solution  to  the  problem 

9 

utt  —  c  uxx  =  sinx,  x  G  M,  t  >  0, 
u(x,  0)  =  Ut(x,  0)  =  0,  x  G  R. 

Graph  the  solution  surface  when  c—  1. 

2.  Write  a  formula  for  the  solution  to  the  problem 

ut  —  kuxx  =  sinx,  x  E  R,  t  >  0, 
u(x,  0)  =  0,  x  G  R. 

3.  Using  Duhamel’s  principle,  find  a  formula  for  the  solution  to  the  initial 
value  problem  for  the  convection  equation 

ut  +  cux  =  /(x,  £),  x  G  R,  t  >  0;  u(x,  0)  =  0,  xGl. 

Hint:  Look  at  the  problem 

Wt(x,  £;  r)  +  cwx(x,  t\ r)  =  0,  x  G  ,t  >  0;  w(x,  0;  r)  =  /(x,  r),  xGl. 
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4.  Solve  the  problem 

ut  +  2 ux  =  xe~l ,  x  G  R,  £  >  0;  u(x,  0)  =  0,  xGl. 


5.  Formulate  Duhamel’s  principle  and  solve  the  initial  boundary  value  prob¬ 
lem 


Solution: 

u(x,  t) 


Ut  =  kuxx  +  f(x,t),  x  >  0,t  >  0, 
u( x,  0)  =  0,  x  >  0, 
w(0,  t)  =  0,  t  >  0. 


V>t  —  t)  —  G(x  +  y,t  —  r))f(y,  r)dyd,T. 


2.6  Laplace  Transforms 


Laplace  transforms  are  first  encountered  in  elementary  differential  equations 
courses  as  a  technique  for  solving  linear,  constant-coefficient,  ordinary  differ¬ 
ential  equations;  Laplace  transforms  convert  an  ODE  into  an  algebra  problem. 
They  are  particularly  useful  for  nonhomogeneous  differential  equations  with 
impulse  or  discontinuous  sources.  The  ideas  easily  extend  to  PDEs,  where  the 
the  operation  of  Laplace  transformation  converts  PDEs  to  ODEs. 

Let  u  =  u(t)  be  a  piecewise  continuous  function  on  t  >  0  that  does  not 
grow  too  fast;  for  example,  assume  that  u  is  of  exponential  order ,  which  means 
that  \u(t)\  <  cexp(at)  for  t  sufficiently  large,  where  a,c  >  0.  Then  the  Laplace 
transform  of  u  is  defined  by 


POO 

(. Cu)(s )  =  U(s)=  /  u(t)e 

J  0 


—  st 


dt. 


(2.48) 


The  Laplace  transform  is  an  example  of  an  integral  transform;  it  takes  a  given 
function  u(t)  in  the  time  domain  and  converts  it  to  a  new  function  U (s),  in  the 
so-called  transform  domain.  U  and  s  are  called  the  transform  variables.  The 
Laplace  transform  is  linear  in  that 

C(c\u  +  C2V)  =  c\Cu  +  C2C  v 

where  c\  and  C2  are  constants.  If  the  transform  U (s)  is  known,  then  u(t)  is 
called  the  inverse  transform  of  U(s)  and  we  write  jC~1U  =  u.  Pairs  of  Laplace 
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transforms  and  their  inverses  are  tabulated  in  books  of  tables,  and  many  soft¬ 
ware  packages,  such  as  MATLAB,  have  simple  commands  that  yield  transform 
pairs.  A  short  table  is  given  at  the  end  of  this  section  (Table  2.1). 

The  importance  of  the  Laplace  transform,  like  other  transforms,  is  that 
it  changes  derivative  operations  to  multiplication  operations  in  the  transform 
domain.  In  fact,  we  have 

(Cu')(s)  =sU(s)-u(  0),  (2.49) 

(. Cu")(s )  =s2U(s)  —  s'u(O)  —  i/(0).  (2.50) 

Formulas  (2.49)  and  (2.50)  are  readily  derived  using  integration  by  parts,  and 
they  are  the  basic  operational  formulas  for  solving  differential  equations. 


Example  2.7 

Solve  the  initial  value  problem 

u"  +  u  =  0,  t  >  0;  tx(0)  =  0,  v!( 0)  =  1. 


Taking  Laplace  transforms  of  both  sides  of  the  differential  equation  and  using 
(2.50)  gives 

s2U(s)  -  1  +  J7(s)  =  0, 


or 


U(s)  = 


1 


1  +  s2' 

So,  the  ordinary  differential  equation  has  been  transformed  into  an  algebraic 
equation,  and  we  solved  the  problem  in  the  transform  domain.  To  recover  u(t) 
from  its  transform  U(s)  we  look  up  the  inverse  transform  in  Table  2.1  to  find 


u(t)  =  C 


1 


1  +  s‘ 


—  sin  t, 


which  is  the  solution.  □ 


In  the  preceding  example  we  were  recovered  a  function  from  its  Laplace 
transform  by  looking  in  a  table.  One  may  ask,  in  general,  how  to  determine 
u(t)  from  knowledge  of  its  transform  U ( s ).  The  answer  to  this  question  requires 
knowledge  of  complex  variables;  it  would  take  us  far  afield  to  give  a  thorough 
discussion.  However,  we  do  indicate  the  general  formula  to  compute  u(t)  from 
its  transform  U(s).  The  inversion  formula  is 


u(t)  =  (£_1f/)(i) 


1 


•  a+ioo 


U  (s)estds. 


2iri 


a— ioc 
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The  integral  in  this  formula  is  a  complex  contour  integral  taken  over  the  vertical 
straight  line  in  the  complex  plane  from  a  — ioc  to  a  +  zoo.  The  number  a  is  any 
real  number  for  which  the  resulting  path  lies  to  the  right  of  any  singularities 
(poles,  essential  singular  points,  or  branch  points  and  cuts)  of  the  function  U ( s ). 
A  thorough  discussion  of  the  inversion  formula  can  be  found  in  the  references  . 

Another  important  result  that  is  extremely  useful  in  calculations  is  the 
convolution  theorem. 

Theorem  2.8 

(Convolution  Theorem)  Let  u  and  v  be  piecewise  continuous  on  t  >  0  and 
of  exponential  order.  Then 


£(u*v)(s)  =  U(s)V(s), 


where 


(u*v)(t)=  /  u(t  —  r)v(r)dr 

Jo 


is  the  convolution  of  u  and  v,  and  U  =  Cu ,  V  =  Cv.  Therefore, 


£-1(U(s)V(s))  =  (u*v)(t).  □ 


The  convolution  theorem  tells  what  to  take  the  transform  of  in  order  to 
get  a  product  of  Laplace  transforms,  namely,  the  convolution.  It  also  states 
that  the  inverse  transform  of  a  product  of  two  transforms  is  the  convolution 
of  the  two.  It  is  easy  to  verify  that  (u  *  v)(t)  =  (v  *  u)(t ),  so  the  order  of  the 
convolution  does  not  make  a  difference.  We  know  that  the  Laplace  transform  is 
additive,  but  this  shows  it  is  not  multiplicative.  That  is,  the  Laplace  transform 
of  a  product  is  not  the  product  of  the  Laplace  transforms. 


Example  2.9 

Find  the  solution  of  the  IVP 

u  +  3u  =  /(t),  'u(O)  =  1. 

Taking  the  transform  of  the  equation  gives 

sU(s)  -  1  +  3 U(s)  =  F(s ),  F(s)  =  Cf{t), 


or 


1 


+ 


1 


s  T  3  s  T  3 


F(s). 
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Taking  the  inverse  transform  yields 

u(t)  =  e~3t  +  /r 1  . 

The  convolution  theorem  implies  that  the  second  term  on  the  right  is 

e~3t  *  f(t). 


Therefore 


+ 


e“3  (*-T)/(r)dr. 


□ 


To  review  the  preceding  method,  taking  the  Laplace  transform  of  an  ordi¬ 
nary  differential  equation  for  u(t)  results  in  an  algebraic  equation  for  U (s) 
in  the  transform  domain.  We  solve  the  algebraic  equation  for  U ( s )  and  then 
recover  u(t)  by  inversion. 

Partial  Differential  Equations 

The  same  strategy  applies  to  partial  differential  equations  where  the  unknown 
is  a  function  of  two  variables,  for  example  u  =  u(x,  t).  Now  we  transform  on  t, 
as  before,  with  the  variable  x  being  a  parameter  unaffected  by  the  transform. 
In  particular,  we  define  the  Laplace  transform  of  u(x,  t)  by 

POO 

(Cu)(x,s)  =  U(x,s)  =  /  u(x,t)e~stdt.  (2.51) 

Jo 

Then  time  derivatives  transform  as  in  (2.49)  and  (2.50);  for  example, 

(. Cut)(x ,  s )  =  sU (x,  s )  —  u(x,  0),  ( Cuu)(x ,  s)  =  s2U (x,  s)  —  su(x ,  0)  —  ut(x,  0). 


On  the  other  hand,  spatial  derivatives  are  left  unaffected;  for  example, 

roo  &  _  d  f°° 

(. Cux)(x,s)=  /  —u(x,t)e  stdt=—~  /  u(x,t)e  stdt  =  Ux(x,  s). 

J o  Vx  Jo 

Therefore,  upon  taking  Laplace  transforms,  a  PDE  in  x  and  t  is  reduced  to  an 
ordinary  differential  equation  in  x\  all  the  t-derivatives  are  turned  into  mul¬ 
tiplication  in  the  transform  domain.  We  point  out  that  in  some  problems  the 
transform  may  be  take  on  x,  with  t  as  a  parameter.  The  choice  depends  on  the 
type  of  boundary  conditions. 


Example  2.10 

(Contaminant  transport)  Let  u  =  u(x,  t )  denote  the  concentration  of  a 
chemical  contaminant  dissolved  in  a  fluid  in  the  semi-infinite  domain  x  >  0 
at  time  t.  Initially,  assume  that  the  domain  is  free  from  contamination.  For 
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times  t  >  0  we  impose  a  constant  unit  concentration  of  a  contaminant  on  the 
boundary  x  =  0,  and  we  ask  how  this  contaminant  diffuses  into  the  region. 
Assuming  a  unit  diffusion  constant,  the  mathematical  model  is 

Ut  uXx  —  0?  x  0,  t  0, 
u(x,  0)  =  0,  x  >  0, 
u(0,t)  =  1,  t  >  0;  u(x,t)  bounded. 

Taking  Laplace  transforms  of  both  sides  of  the  PDE  gives 

sU(x,s)  -  Uxx(x,s)  =  0. 

This  is  an  ordinary  differential  equation  with  x  as  the  independent  variable, 
and  the  solution  is 


U (x,  s)  =  a(s)e~^x  +  b(s)e^x . 
Because  we  want  bounded  solutions,  we  set  b(s)  =  0.  Then 

U (x,  s)  =  a(s)e~v^x. 


Now  we  take  the  Laplace  transform  of  the  boundary  condition  to  get  U (0,  s)  = 
1/s,  where  we  have  used  C{  1)  =  1/s.  Therefore,  a(s)  =  1/s,  and  the  solution 
in  the  transform  domain  is 


U (x,  s)  = 


Now  we  must  invert  the  transform.  Consulting  Table  2.1  or  a  computer  algebra 
program,  we  find  that  the  solution  is 


u(x,  t)  =  erfc 


x 


where  erfc  is  the  complimentary  error  function  defined  by  the  formula 


Observe  that 


erfc  (y)  =  1 


dr. 


erfc  (y)  =  1  —  erf  (y). 


The  complimentary  error  function  is  plotted  in  Figure  2.8.  □ 
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Figure  2.8  Plot  of  the  complimentary  error  function  erfc(y).  In  the  solution 

y  =  x/VM 


Example  2.11 

In  the  contaminant  transport  model  in  the  last  example  let  us  change  the 
boundary  condition  to  a  function  of  time  and  consider 


'Ut  ^ xx  —  0?  t  >  0, 

u{x,  0)  =  0,  x  >  0, 

u(0,t)  =  f(t),  t  >  0;  u(x,t)  bounded. 

Taking  Laplace  transforms  of  both  sides  of  the  PDE  gives,  as  in  the  example, 

sU (x,  s)  -  Uxx(x,  s)  =  0, 


which  has  solution 

U(x,s)  =  a(s)e~^x . 

Now  we  take  a  Laplace  transform  of  the  boundary  condition  to  get  17(0,5)  = 
F(s).  Therefore,  a(s)  =  F(s)  and  the  solution  in  the  transform  domain  is 


U(x,s)  =  F(s)e~^x. 


Consulting  Table  2.1,  we  find  that 


c-i  L-v~sA  = 


X 


V  47 rt3 


,  —  x2 /At 


Therefore,  we  can  use  the  convolution  theorem  to  write  the  solution  as 


u(x,  t)  = 

J  o 


X 


o  \/4n(t  —  r)3 


e-x2/Wt-T))f(T)dT.  □ 
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A  very  complete  reference  for  the  Laplace  transform,  which  includes  an 
extensive  table,  theory,  and  applications,  is  Churchill  (1970). 

Parameter  Identification  Problems 

In  some  physical  problems  all  of  the  input  parameters  may  not  be  known  a 
priori.  In  a  parameter  identification  problem  we  ask  whether  it  is  possible 
to  take  certain  measurements  and  thereby  determine  an  unknown  parameter 
(a  constant)  or  distributed  parameter  (a  function)  in  a  PDE.  For  example, 
suppose  the  diffusivity  of  a  metal  rod  is  unknown.  Will  holding  the  temperature 
constant  at  one  end  and  measuring  the  heat  flux  out  the  other  end  determine 
the  diffusivity?  Parameter  identification  problems  are  a  subclass  of  inverse 
problems;  these  can  be  described  qualitatively  as  problems  where  the  input 
is  to  be  recovered  from  the  output,  rather  than  conversely,  which  is  the  case 
for  so-called  direct  problems.  All  of  the  problems  we  encountered  so  far  are 
direct  problems  (given  all  the  data,  find  the  solution),  but  one  can  argue  that 
inverse  problems  play  an  equally  important  role  in  applied  science.  In  many 
empirical  problems  we  want  to  measure  the  output  of  experiments  and  use 
that  information  to  determine  properties  of  the  system.  In  tomography,  for 
example,  we  can  we  learn  about  density  variations  in  tissue  by  observing  the 
reflection  and  transmission  properties  of  sound  waves.  Here  we  will  not  be  too 
ambitious,  but  rather  only  introduce  some  of  the  issues. 

Example  2.12 

Consider  a  long  (x  >  0)  metal  slab  of  unknown  thermal  conductivity  K  but 
known  density  p  and  specific  heat  c.  For  simplicity,  take  p  =  c  =  1.  Suppose 
further  that  measurements  can  be  made  only  at  the  face  x  =  0.  If  a  known 
temperature  f(t)  at  x  =  0  is  applied,  can  we  measure  the  heat  flux  at  x  =  0 
at  a  single  instant  of  time  to  and  thereby  determine  K1  We  assume  /( 0)  =  0. 
The  PDE  model  is 

ztf  —  KuXx'>  Xj  t  0,  (2.52) 

u(0,  t)  =  /(t),  t  >  0,  (2.53) 

u(x,  0)  =  0,  x  >  0,  (2.54) 

which  is  a  well-posed  problem.  We  are  asking  whether  we  can  determine  K 
from  a  single  flux  measurement 


-Kux(0,tQ)  =  a, 


(2.55) 
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where  a  is  known.  Using  Laplace  transforms  we  have  already  solved  the  direct 
problem.  The  solution  is 


u(x,  t) 


x 


o  y  4iL7r(t  —  r)3 


e-,V(4 K(t-r))  f{T)dT_ 


(2.56) 


It  appears  that  the  strategy  should  be  to  calculate  the  flux  at  (0,  to)  from  the 
solution  formula  (2.56).  Indeed  this  is  the  case,  but  calculating  the  x  derivative 
of  u  is  not  a  simple  matter.  The  straightforward  approach  of  pulling  a  par¬ 
tial  derivative  d/dx  under  the  integral  sign  fails  because  one  of  the  resulting 
improper  integrals  cannot  be  evaluated  at  x  =  0;  it  does  not  exist.  The  reader 
should  verify  this  statement.  Therefore,  we  must  be  more  clever  in  calculating 
ux.  To  this  end  we  note  that  (2.56)  can  be  written  as 

u  d 

u(x,t)  =  —  2K  /  —G(x,t  —  r)f(r)dr, 

Jo 


dx 


where  G(x,t)  is  the  heat  kernel 


G(x,t)  = 


1 


V47T  Kt 


-x2 /{AKt) 


Therefore, 


d 2 

ux(x,t)  =  -2K  J  —G(x,t  -  r)f{r)dr 
2  [  S-G(x,t  —  r)/(r)dr, 


(2.57) 


(2.58) 


'o  dr 

since  —  GT  =  Gt  =  KGXX  (the  heat  kernel  satisfies  the  heat  equation).  Now  we 
integrate  by  parts  to  obtain 


ux(x,t)  =  -2  [  G(x,t-T)f(T)d,T. 

Jo 


The  boundary  terms  generated  by  the  integration  by  parts  are  zero.  Conse¬ 
quently,  we  have 

•to 


■Kux(0,to)  =  \fK  f  — 

Jo  V 


fir) 


■n(to  -  t) 


.dr  =  a. 


This  equation  uniquely  determines  K  and  solves  the  parameter  identification 
problem. 

For  example,  if  f{t)  =  /?£,  i.e.,  the  temperature  is  increased  linearly,  then 
the  integral  can  be  calculated  exactly  to  obtain 
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Sections  throughout  the  text  provide  exercises  showing  additional  examples- 
typically,  inverse  problems,  and  in  particular  distributed  parameter  identifica¬ 
tion  problems,  lead  to  an  integral  equation  for  the  unknown  function.  These 
equations  are  difficult  to  resolve,  and  stability  is  often  a  problem.  That  is,  a 
small  change  in  measurement  can  cause  a  large  change  in  the  calculated  param¬ 
eter  value. 


Table  2.1  Laplace  Transforms 


u(t) 

U(s) 

1 

gdt 

tn,  n  a  positive  integer 

s_1,  s  >  0 

,  s  >  a 

s—a  7 

n!  Q  >  0 

sn+l  1  &  ^  VJ 

sin  at  and  cos  at 

sinh  at  and  cosh  at 

H(t  -  a) 

&  nprl  ^ 

o  |  9  9  |  9  • 

sz-\-az  sz-\-az  7 

2  a  o  and  2  5  2 , 

sz—az  sz—az  7 

s~1e~CLS ,  5  >  0 

5  >  0 

s  >  a 

5{t  —  a) 

H(t  —  a 

)f(t  -  a) 

g  — as 

F(s)e~as 

f(t)e  at 

F(s  +  a) 

H(t  —  a 

)f(t ) 

e~asC(f(t  +  a)) 

erf  y/t 

s_1(l  -I-  s)_1/2, 

s  >  0 

7*  exp  ( 

'-a2) 

V  4t  ) 

Y^7r/ 5  e~a^ ,  (s 

>0) 

1  —  erf 

(ab) 

^-ig-ays,  s  >  0 

2t%2  exp  (  4“  ) 

yCre -ay/s  ^  5  >  0 

EXERCISES 

1.  Consider  the  nonhomogeneous  initial  value  problem 

u"  +  4 u  =  i/(0)  =  0,  u'( 0)  =  0. 

a)  Use  Duhamel’s  principle  to  solve  this  problem.  That  is,  show 


u(t)  =  /  w(t  —  r,  r)dr 

Jo 
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where  (re,  r)  is  the  solution  to 

w"  +  Aw  =  0,  re(0,  r)  =  0,  u/(0,  r)  =  re 

b)  Use  Laplace  transforms  to  verify  your  solution. 

2.  Find  the  inverse  transform  of 

U(s)  =  1 


—  r 


s(s2  +  1) 


using  the  convolution  theorem. 


3.  Using  the  integral  definition  of  Laplace  transform,  find  the  transform  of 
u(t)  =  Vi.  Hint:  Make  the  substitution  r2  =  st. 


4.  Show  that  C  ^ f{r)dr^j  = 


5.  Show  that  C  ( H(t  —  a)  f(t  —  a))  =  e~asF(s),  where  H  is  the  unit  step  func¬ 
tion  (the  Heaviside  function)  defined  by  H(x)  =  0  for  x  <  0,  and  H(x)  =  1 
for  x  >  0. 

6.  Solve  utt  =  uxx  for  x,t  >  0  with  u(0,t)  =  sint,  t  >  0,  and  u(x,  0)  = 
0,  Ut(x,  0)  =  1,  x  >  0. 

7.  Solve  ut  =  uxx  on  x,  t  >  0  with  u( 0,  t)  =  a,  t>  0  and  u(x,  0)  =  6,  where  a 
and  b  are  constants. 

8.  Use  Laplace  transforms  to  solve  the  initial  boundary  value  problem 

'U't  —  ^ xx->  ^  >  0,  t  0, 

ux(0,t)  -  u(0,t)  =  0,  t  >  0, 
u(x,  0)  =  x  >  0. 

Interpret  this  model  physically  in  the  context  of  heat  flow. 


9.  Find  a  bounded  solution  to 

'U'tt 

u(0,t)  = 
u(x ,  0)  = 


C2uxx  -  f(t),  x  >  0,  t  >  0, 
0,  t  >  0, 

Ut(x,  0)  =  0,  x  >  0. 


10.  Solve  the  following  problem  using  Laplace  transforms. 

Utt  =  C2uxx  -  g,  x  >  0,  t  >  0, 
u( 0,  t)  =  0,  t  >  0, 
u(x,  0)  =  Ut(x,  0)  =  0,  x  >  0. 

The  solution  shows  what  happens  to  a  falling  cable  lying  on  a  table  that  is 
suddenly  removed.  Sketch  some  time  snapshots  of  the  solution. 
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11.  In  the  quarter  plane  x,y  >  0,  where  the  temperature  is  initially  zero,  heat 
flows  only  in  the  ^-direction;  along  the  edge  y  =  0  heat  is  convected  along 
the  x-axis,  and  the  temperature  is  constantly  1  at  the  point  x  =  y  =  0. 
The  boundary  value  problem  for  the  temperature  u(x,  y ,  t)  is 

ut  =  Uyy ,  x,t,y  >  0, 
u(x,  y ,  0)  =  0,  x,  y  >  0, 
u{ 0, 0,  t)  =  1,  t  >  0, 

Ut(x,  0,  t)  +  ux(x,  0,  t)  =  0,  x,£>0. 

Find  a  bounded  solution  using  Laplace  transforms. 

12.  A  very  deep  container  of  liquid  is  insulated  on  its  sides.  Initially,  its  temper¬ 
ature  is  a  constant  uq  degrees,  and  for  t  >  0  heat  radiates  from  its  exposed 
top  surface  according  to  Newton’s  law  of  cooling  (see  Section  1.3).  The 
air  temperature  is  zero  degrees,  (a)  Formulate  an  initial  boundary  value 
problem  for  the  temperature  of  the  liquid,  and  find  a  formula  for  the  tem¬ 
perature  at  various  depths  at  various  times,  (b)  Take  p  =  c  =  K  =  f3  =  1, 
and  use  a  computer  algebra  system  to  plot  some  temperature  profiles. 

13.  Derive  the  solution  u(x,t)  =  H{t  —  x/c)g(t  —  x/c)  to  the  problem 

Uff  —  C  Uxxi  x^t  0, 

u(x,  0)  =  Ut(x,  0)  =  0,  x  >  0, 
u( 0,  t)  =  g(t),  t  >  0. 


14.  (Parameter  identification)  Suppose  a  chemical  reactor  occupying  the  space 
x  <  0  operates  at  some  unknown  temperature  F(t).  To  determine  F(t) 
we  insert  a  long,  laterally  insulated  metal  probe  of  unit  diffusivity,  and  we 
measure  the  temperature  U(t)  of  the  probe  at  r  =  1.  Assume  that  the 
probe  occupies  0  <  x  <  oo.  Formulate  the  model  equations  and  show  that 
F(t)  satisfies  the  integral  equation 

F(t) 


U(t)  = 


1 


,e-l/4  (t-T)d 


20^  Jo  (t  -  r)3/2 

Hint:  Solve  the  direct  problem  by  Laplace  transforms  and  use  the  table 
entry 


CT \e~^sx) 


y/sx\  _ 


X 


-cc  /  (4t) 


2V/Tt3/2 

Suppose  the  temperature  of  the  reactor  F(t)  is  a  constant  Fq;  find  Fq  if 
the  temperature  at  x  =  1  is  found  to  be  10  degrees  at  t  =  5. 


15.  Verify  (2.59). 
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The  Fourier  transform  is  another  integral  operator  with  properties  similar  to 
the  Laplace  transform  in  that  derivatives  are  turned  into  multiplication  oper¬ 
ations  in  the  transform  domain.  Thus  the  Fourier  transform,  like  the  Laplace 
transform,  is  useful  as  a  computational  tool  in  solving  differential  equations.  In 
PDEs  the  Laplace  transform  is  usually  applied  to  the  time  variable,  while  the 
Fourier  transform  is  often  applied  to  the  spatial  variable  when  it  varies  over 
(  —  00,  oo). 

We  begin  with  functions  of  one  variable.  The  Fourier  transform  of  a 
function  u  =  u(x),  x  E  R,  is  defined  by  the  equation 

/oo 

u(x)el^xdx.  (2.60) 

-oo 


If  u  is  absolutely  integrable,  i.e.,  \u\dx  <  oo,  then  u  =  u(f)  can  be  shown 
to  exist. 

The  transform  of  a  function  u  =  u(x)  may,  or  may  not,  turn  out  to  be 
a  complex-valued  function.  The  variable  £  is  real,  but  the  value  u(£)  can  be 
complex.  Further,  there  are  a  lot  of  common,  simple  functions  that  do  not  have 
a  classical  Fourier  transform  because  the  improper  integral  does  not  exist.  For 
example, 

u(x)  =  ex  and  u(x)  =  C 


do  not  have  Fourier  transforms.  Only  functions  that  decay  at  Too  sufficiently 
fast  have  Fourier  transforms. 

In  the  theory  of  Fourier  transforms,  it  is  common  to  work  with  the  set  S  of 
rapidly  decreasing  functions  on  R:  these  are  the  functions  with  continuous 
derivatives  of  all  orders,  and  for  which  each  function  and  all  its  derivatives 
decay  to  zero  as  x  —>  Too  faster  than  any  power  function  (functions  like  1/x2 
and  1/x 6).  The  function  exp(— x2)  defining  the  bell-shaped  curve  is  a  such  a 
rapidly  decreasing  function.  More  technically,  if  the  set  of  functions  that  have 
continuous  derivatives  of  all  orders  on  R  is  denoted  by  C°°,  then 


S  =  {ueC°° 


u 


(*0 


(x)\  <  M 


x 


■N 


as  x 


oo. 


k  =  0, 1,  2, . . . ;  for  all  integers  TV}. 


The  set  S  is  called  the  Schwartz  class  of  functions,  and  one  can  show  that 
if  u  G  5,  then  u  G  <S,  and  conversely.  So  S  is  a  closed  set  under  both  Fourier 
transformation  and  inversion,  which  makes  it  a  good  set  to  work  with.  In  this 
text,  however,  a  purely  formal  approach  is  not  taken;  rather,  the  goal  is  to 
understand  how  Fourier  transforms  provide  a  useful  tool  for  problem-solving. 
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There  is  one  important  remark  about  notation.  There  is  no  standard  con¬ 
vention  on  how  to  define  the  Fourier  transform  (2.60);  some  put  a  factor  of 
1  / (27 r)  or  1/ V27T  in  front  of  the  integral,  and  some  have  a  negative  power  in 
the  exponential,  or  even  include  a  factor  of  2n  in  the  exponential.  One  should 
be  aware  of  these  variations  when  consulting  other  sources,  especially  tables  of 
transforms. 

A  basic  property  of  the  Fourier  transform  is  that  the  kth  derivative  u ^ 
(k  =  1,2,...)  transforms  to  an  algebraic  expression.  That  is, 

(Tu^)(0  =  (-iOku(0,  (2.61) 

confirming  our  comment  that  derivatives  are  transformed  to  multiplication  (by 
a  factor  of  This  formula  is  easily  proved  using  integration  by  parts  (as 

for  the  Laplace  transform);  all  the  boundary  terms  generated  in  the  integration 
by  parts  are  zero,  since  u  and  all  its  spatial  derivatives  vanish  at  Too. 

For  functions  of  two  variables,  say  u  =  u(x,t),  the  variable  t  acts  as  a 
parameter,  and  we  define  on  the  variable  x  as 

/oo 

u(x,  t)el^xdx. 

-oo 

Then,  under  Fourier  transformation,  ^-derivatives  turn  into  multiplication,  and 
t  derivatives  remain  unaffected;  for  example, 

(Tux)^,t)  =  (—i£)u(£,  t). 

{Fuxx){£,t)  = 

(Tut)(£,t)  =  ut(£,t). 


Solving  a  differential  equation  for  u  involves  first  transforming  the  problem 
into  the  transform  domain  (u  and  £)  and  then  solving  the  resulting  differential 
equation  for  u.  One  is  then  faced  with  the  inversion  problem,  or  the  problem 
of  determining  the  u  for  which  Tu  =  u.  Another  nice  property  of  the  Fourier 
transform  is  the  simple  form  of  the  inversion  formula,  or  inverse  transform. 
It  is 


(. T  lu){x)  =  u(x) 


1 

2?r 


uiOe-^dC 


(2.62) 


This  result  is  called  the  Fourier  integral  theorem;  it  dictates  how  to  get 
back  from  the  transform  domain. 

Some  Fourier  transforms  can  be  calculated  directly,  but  many  require  com¬ 
plex  contour  integration.  In  the  next  example  we  calculate  the  transform  using 
a  differential  equation  technique. 
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Example  2.13 

2 

We  calculate  the  transform  of  the  Gaussian  function  u(x)  =  e~ax  ,  a  >  0,  or 
the  classical  bell-shaped  curve.  By  definition, 


e~ax2e^xdx 


Differentiating  with  respect  to  £  and  then  integrating  by  parts  gives 


*oo 


u'(0  = 


xe~ax2elixdx 


-OO 


—  l 


•OO 


d 


2 a  7_00  dx 


e~ax2e^xdx 


=  — —u(£). 

2  a 

Therefore,  we  have  a  differential  equation  u'  —  for  u.  Separating  variables 
and  integrating  gives  the  general  solution 


u 


(0  =  Ce"^/(4a) 


The  constant  C  can  be  determined  by  noticing  that 

/OO 

e~ax  dx  = 

-oo 

Consequently,  we  have 

JF{e~axl)  =  We-?2/(4o).  (2.63) 

V  CL 

So,  the  Fourier  transform  of  a  Gaussian  function  is  itself  a  Gaussian;  likewise, 
the  inverse  transform  of  a  Gaussian  is  a  Gaussian.  Equation  (2.63)  has  an 
important  interpretation.  Let  us  regard  u[x)  =  e  ax  ,  a  >  0  as  a  signal 
representing  information.  If  a  is  large,  then  the  signal,  i.e.,  the  bell-shaped 
curve,  has  a  very  high, narrow  peak;  information  in  the  signal  is  localized  and 
coherent.  In  the  transform  domain,  however,  the  signal  u(£)  is  broad  with  a 
low  peak.  The  information  has  spread  and  coherence  is  lost.  The  opposite  is 
true  when  a  is  small:  broad  signals  transform  to  narrow  ones.  □ 

Remark.  We  can  think  of  a  Fourier  transform  as  resolving  a  function  into  all 
of  its  frequencies.  A  measurement  of  the  frequency  spectrum  is  the  modulus 

HO  I-  □ 
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Example  2.14 

If  u(x)  =  H(x  +  1)  —  H(x  —  1)  (a  square  pulse),  then  it  is  easily  shown  that 
u(£)  =  2  .  The  frequency  spectrum  is 


^  i  2  sin  | 

Cl 

e 

| 

The  plot  of  the  frequency  spectrum  shows  the  frequency  composition  of  the 
square  wave  pulse.  □ 

Similar  to  the  case  of  Laplace  transforms,  a  convolution  relation  holds  for 
Fourier  transforms.  If  u,  v  G  <S,  then  we  define  their  convolution,  which  is  in 
s,  by 

/OO 

u(x  —  y)v(y)dy. 

-oo 

Then  we  have  the  following  theorem. 

Theorem  2.15 

(Convolution  theorem)  If  n,  v  G  S.  then 

By  the  Fourier  integral  theorem  it  follows  immediately  that 

(u*v)(x)  =  F~1(u(t)v(£)). 

This  formula  states  that  the  inverse  transform  of  a  product  of  Fourier  trans¬ 
forms  is  a  convolution;  this  is  a  useful  relationship  in  solving  differential  equa¬ 
tions. 


Example  2.16 

Let  /  G  S  and  determine  u  for  which 

u"  —  u  =  /(#),  xGl. 
Taking  the  transform  of  both  sides  yields 

{-iifu  —  u  =  f , 


or 


1 


/(0- 


i+e 
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In  the  transform  domain  the  solution  is  a  product  of  transforms,  and  so  we 
apply  the  convolution  theorem.  From  the  Exercises  we  have 


Therefore, 


1 

T+e' 


\  [  e  lx  v]f(y)dy.  □ 

^  J  —  OO 


The  strategy  in  applying  transform  methods  to  solve  partial  differential 
equations  is  to  proceed  formally,  making  any  assumptions  that  are  required 
to  obtain  an  answer;  for  example,  assume  that  all  the  data  is  in  S.  When  a 
solution  is  obtained  one  can  then  attempt  to  verify  that  it  is  indeed  a  solution 
to  the  problem.  Often  one  can  prove  that  the  solution  obtained  holds  under  less 
severe  conditions  than  required  in  the  application  of  the  transform  method. 

Now  we  apply  the  Fourier  transform  method  to  the  Cauchy  problem  for  the 
heat  equation.  We  will  derive  the  same  solution  formula  (2.8)  that  we  obtained 
in  Section  2.1  by  a  different  method. 


Example  2.17 

(Heat  equation)  Use  Fourier  transforms  to  solve  the  pure  initial  value  problem 
for  the  heat  equation: 

ut  —  kuxx  =  0,  xG,t>0;  u(x,  0)  =  /(#),  xGi  (2.64) 

Again  we  assume  that  /  £  S.  Taking  Fourier  transforms  of  the  PDE  gives 

ut  =  — £2/ch, 

which  is  an  ordinary  differential  equation  in  t  for  &(£,  £),  with  £  as  a  parameter. 
Its  solution  is 


&(£,  t)  =  Ce  ^  kt. 

But  the  initial  condition  gives  u(£,  0)  =  /(£),  and  so  C  =  /(£).  Therefore, 

u{£,t)  =  e_$2fct/(0- 


Replacing  a  by  1/(4 kt)  in  formula  (2.63)  gives 

T  (  1  p-x2/(4kt)\  =  e-e 

\  V47 xkt  J 

Thus,  by  the  convolution  theorem  we  have 

1 


kt 


*oo 


u(x,  t ) 


■oo 


\/  Ankt 


(x-y)  /(4 kt) 


f(y)dy. 


(2.65) 
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This  solution  was  derived  under  the  assumption  that  /  is  in  the  Schwartz  class. 
But  now  that  we  have  it,  we  can  attempt  to  show  that  it  is  a  solution  under 
milder  restrictions  on  /.  For  example,  one  can  prove  that  (2.65)  is  a  solution 
to  (2.64)  if  /  is  a  continuous,  bounded  function  on  R.  □ 


Example  2.18 

(Laplace’s  equation  in  a  half  plane)  Now  we  solve  a  basic  problem  involving 
Laplace’s  equation  in  the  upper  half  plane.  Consider 


Uxx  +  uyy=  0,  X  e  ,  y  >  0;  u(x,0)  =  f(x),  x  e  R. 

We  also  append  the  condition  that  the  solution  u  remains  bounded  as  y  oo. 
This  example  is  similar  to  the  last  example.  Taking  the  transform  (on  x  with 
y  as  a  parameter)  of  the  PDE,  we  obtain 

Uyy  ~  £* 1 2U  =  0, 

which  has  general  solution 

«(£,  y)  =  a(£)e~Sy  +  b(£)eiy. 

The  boundedness  condition  on  u  forces  b(£)  =  0  if  £  >  0  and  a(£)  =  0  if  £  <  0. 
So  we  take 

«(£,y)  =  c(0e~^v- 

Upon  applying  Fourier  transforms  to  the  boundary  condition,  we  get  c(£)  = 

A 

/(£).  Therefore,  the  solution  in  the  transform  domain  is 

HZ,y)  =  e_l?|j7(0- 


Then,  using  the  convolution  theorem,  we  obtain  the  solution 


y 


u{x,y)  =  - 


1 


7T  xz  +  y* 


*/  =  - 


•  oo 


f(r)d/r 


7T  J_00  ( x  -  t)2  +  y2  ' 


□ 


An  excellent,  brief  introduction  to  Fourier  transforms  can  be  found  in  Vret- 
blad  (2006). 


EXERCISES 

1.  Find  the  convolution  of  the  functions  f(pc)  —  x  and  g(pc)  =  e 

2.  Show  that  the  inverse  Fourier  transform  of  e-a^,  a  >  0,  is 

a  1 


7T  x2  +  a2 
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3. 


4. 

5. 

6. 

7. 

8. 

9. 

10. 


11. 


12. 


Verify  the  following  properties  of  the  Fourier  transform: 

a)  (Tu)(0=2ir(T~1u)(-0. 

b)  J7 (eiax u) +  a). 

c)  F(u(x  +  a))  =  e~ta^u(^). 

Formula  (a)  states  that  if  a  transform  is  known,  so  is  its  inverse,  and  con¬ 
versely. 

If  T  Ore-M)  =  T^,  find  T  ■ 

If  u(x)  =  e~'x',  compute  u  *  u  and  then  find  the  inverse  Fourier  transform 

of _ I _ 

Ol  (1+£2)2- 

Find  the  Fourier  transform  of  the  function  u  defined  by  u(x)  =  e~ax  if 
x  >  0,  and  u(x)  =  0  if  x  <  0.  Plot  the  frequency  spectrum. 

Compute  T[xe~axl\  Hint:  Use  (2.61). 


Compute  the  Fourier  transform  of  u{pc) 
0  if  |  x  |>  7r/2 a. 


cos  ax  if  |  x  \<  it /2a,  and  u(x)  = 


Solve  the  following  initial  value  problem  for  the  inhomogeneous  heat  equa¬ 
tion: 

Ut  =  uxx  +  F{x,  £),  x  G  ,  t  >  0  u(x,  0)  =  0,  xGl. 


Find  a  formula  for  the  solution  to  the  following  initial  value  problem  for 
the  free  Schrodinger  equation: 

ut  =  iuxx ,  x  G  ,  t  >  0;  u{pc,  0)  =  e~x  ,  x  G  R. 


Find  a  bounded  solution  to  the  Neumann  problem 

^ XX  H-  Uyy  —  0,  X  G  ,  y  >  0, 

0)  =  g(x),  xGi 

Hint:  Let  v  =  uy  and  reduce  the  problem  to  a  Dirichlet  problem.  The 
solution  is 

u(x,  y)=2^J  9{x-  £)  In (y2  +  $.2)d£  +  C. 

Solve  the  boundary  value  problem 


^ xx  H-  'Uyy  —  0?  x  G  M,  y  0, 


u(x,  0)  =  1,  \x\  <  /;  u(x,  0)  =  0,  \x\  >  l 
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13.  Use  integration  by  parts  to  verify  (assume  u  E  S) 

(Tux)(£,t)  =  (-i£)u(£,t), 
(Fuxx)(£,t)  =  (-i£)2u(£,t). 


14.  Let  u(x)  be  a  square  wave,  i.e,  u(x) 
Show  that 

Plot  the  frequency  spectrum. 


1  if  \x\  <  a  and  u(x)  =  0  if  \x\  >  a 


2  sin  a£ 


15.  Solve  the  Cauchy  problem  for  the  advection-diffusion  equation  using 
Fourier  transforms: 


ut  =  Duxx  —  cux ,  xGl,  t  >  0;  u(x,  0)  =  (f)(x),  x  E  R. 


16.  This  exercise  explores  the  role  of  a  term  uxxx  (called  a  dispersion  term) 
in  a  PDE  by  examining  the  equation 


'U't  T  ^ xxx  —  0* 


This  equation  is  sometimes  called  the  linearized  Korteweg-de Vries  (KdV) 
equation. 


a)  What  relation  between  uj  and  k  would  have  to  hold  if  a  solution  of  the 
form 

u{x,  t)  =  ei(kx~ut) 

exists?  What  do  these  solutions  look  like,  and  how  does  their  speed 
depend  on  kl  What  does  your  conclusion  mean,  qualitatively?  (Recall 
that  the  real  and  imaginary  parts  of  a  complex  solution  to  a  linear 
equation  are  both  real  solutions.) 


Use  Fourier  transforms  to  solve  the  Cauchy  problem  for  the  linearized 
KdV  equation,  and  write  your  answer  in  terms  of  the  Airy  function 
defined  by 


Ai(x)  = 


1 

27T 


dz. 


IT.  Take  a  Gaussian  function  exp(— x2)  as  the  initial  condition  for  the  Cauchy 
problem  for  the  heat  equation  and  find  the  solution  profile  when  t  =  1. 

18.  Use  Fourier  transforms  to  derive  d’Alembert’s  solution  (2.14)  to  the  Cauchy 
problem  for  the  wave  equations  (2.12)-(2.13)  when  the  initial  velocity  is 
zero,  i.e.,  g{pc)  =  0. 
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19.  (Parameter  identification)  It  is  believed  that  nerve  impulses  are  transmit¬ 
ted  along  axons  by  both  diffusion  and  convection.  In  fundamental  experi¬ 
ments  the  biologists  Hodgkin  and  Keyes  used  radioactive  potassium  42  K  to 
measure  how  ions  convect  and  diffuse  in  squid  axons.  If  u  =  u(x,  t)  is  the 
concentration  of  potassium  in  a  long  axon,  then  the  convection-diffusion 
model  is 

Ut  =  Duxx  —  vux ,  xgR,  t  >  0. 

The  velocity  v  can  be  measured  directly,  but  the  diffusion  constant  D  is 
difficult  to  measure.  To  determine  D  take  a  known  initial  concentration 
u(x,  0)  =  e~x  !a  and  solve  for  the  concentration  Obtain 


u(x,  t ) 


\/  a  +  4  Dt 


-O 


vt)2  /  (a+4Dt) 


Show  how  D  can  be  recovered  from  a  continuous  measurement  U (t)  = 
u(xo,t)  of  the  potassium  concentration  at  some  fixed  location  x  =  Xq. 

20.  Solve  the  integral  equation  for  /: 


y)e  v  dy 


3 

Orthogonal  Expansions 


In  this  chapter  we  study  topics  that  form  the  traditional  core  of  applied 
mathematics — boundary  value  problems  and  orthogonal  expansions.  These  sub¬ 
jects  are  interrelated  in  one  of  the  most  aesthetic  theories  in  all  of  mathematics. 

In  an  abstract  sense,  mathematics  can  be  thought  of  as  imposing  structures 
on  sets  of  objects  and  studying  their  properties.  In  linear  algebra,  for  exam¬ 
ple,  an  algebraic  structure  (addition  and  scalar  multiplication)  is  defined  on  a 
finite-dimensional  vector  space,  typically  Mn,  along  with  geometric  structures 
that  characterize  concepts  like  distance  and  orthogonality  (perpendicularity). 
Finally,  linear  transformations,  represented  by  matrices,  between  vector  spaces 
can  be  introduced  and  we  can  study  how  those  interrelate  with  the  algebraic 
and  geometric  structures.  This  all  leads  to  the  solution  of  linear  systems,  the 
algebraic  eigenvalue  problem,  and  the  decomposition  of  the  vector  space  and 
the  transformation  into  fundamental,  or  canonical,  forms.  What  is  exciting  is 
that  all  of  these  ideas  extend  in  a  straightforward  way  to  infinite-dimensional 
function  spaces  and  linear  differential  operators  defined  boundary  value  prob¬ 
lems  on  those  spaces.  The  theory  is  not  only  insightful  and  unifying,  but  it  has 
far-reaching  applications  to  science,  engineering,  and  mathematics. 


3.1  The  Fourier  Method 

Without  a  lot  of  detail,  we  begin  with  an  example  that  indicates  how  these 
ideas  apply  to  initial  boundary  value  problems  in  PDEs.  The  key  question,  as 
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we  observe,  is  whether  a  function  f{x)  can  be  expanded  in  a  linear  combinations 
of  ‘basis’  functions,  analogous  to  expanding  a  vector  in  terms  of  orthogonal  basis 
vectors.  Next  we  see  why  this  question  arises. 

The  basic  technique  for  solving  PDEs  on  a  bounded  spatial  domain  is  the 
Fourier  method,  named  after  Joseph  Fourier  (1768-1830).  In  this  first  section 
we  take  a  nineteenth-century  perspective  and  make  some  comments  about  the 
origin  of  the  method.  Our  discussion  will  motivate  one  of  the  most  fundamental 
topics  in  analysis  and  in  PDEs,  namely  orthogonal  expansions. 

To  fix  the  notion  let  us  consider  heat  flow  in  a  finite  bar  of  length  l  =  n  and 
unit  diffusivity  (k  =  1),  where  the  ends  are  held  at  zero  degrees.  We  choose  i r 
as  the  length  to  make  the  constants  come  out  simpler  at  the  end.  From  Chapter 
1  we  know  that  the  temperature  u  =  u(x,t)  in  a  laterally  insulated  bar  must 
satisfy  the  model  equations 


Ut  =  uxx,  0  <  x  <  7r;  t  >  0,  (3.1) 

«(0,  t)  =  u(tt,  t)  =  0,  t  >  0,  (3-2) 

consisting  of  the  heat  equation  and  the  boundary  conditions.  For  the  moment, 
we  ignore  the  initial  condition.  It  is  easy  to  check  that  for  any  positive  integer 
n,  a  solution  to  (3.1)  is  given  by 

un(x,t )  =  e~n  t  sin nx.  (3-3) 

Each  of  these  solutions  clearly  satisfies  the  boundary  conditions  (3.2)  as  well. 
For  the  moment,  put  a  hold  on  your  curiosity  about  the  origin  of  the  set  of 
solutions  in  (3.3). 

As  we  know,  for  a  well-posed  problem,  equations  (3.1)  and  (3.2)  are  aug¬ 
mented  by  an  initial  condition  of  the  form 

u(x,  0)  =  /(x),  0  <  X  <  7T.  (3.4) 

We  now  ask  how  a  solution  to  the  initial  BVP  can  be  constructed  from  only 
the  solutions  (3.3)  of  the  boundary  value  problem?  Fourier  argued  that  the 
solution  u(x,t)  to  (3.1),  (3.2),  (3.4)  can  be  taken  as  a  linear  combination  of  the 
infinite  number  of  solutions  un(x,t)  in  (3.3);  that  is, 

oo 

E2  i 

ane~n  sinnx  (3.5) 

n= 1 

for  appropriately  chosen  constants  an.  To  satisfy  the  initial  condition,  (3.4) 
implies  that 

oo 

u(x,  0)  =  f(x)  =  an  sin  nx. 

n—  1 


(3.6) 
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This  means  the  initial  temperature  function  f(x)  must  be  an  infinite  series 
representation  in  terms  of  the  periodic  ‘basis’  functions  sin  ay  sin2oy  sin3oy  . . .. 

On  a  historical  note,  the  calculation  was  troublesome  in  Fourier’s  time,  in 
the  early  1800s.  Concepts  like  convergence  of  series  were  not  well  understood. 
L.  Euler,  D.  Bernoulli,  and  d’Alembert,  all  of  whom  had  addressed  similar 
problems  in  the  mid- 1700s  regarding  the  wave  equation  and  vibrating  strings, 
had  wondered  about  the  possibility  of  expanding  a  nonperiodic  function  f(x) 
in  terms  of  periodic  functions  like  sines  and  cosines.  It  was  L.  Dirichlet  who 
in  1829  finally  established  conditions  under  which  such  series  representations 
like  (3.6)  are  valid.  In  fact,  these  ideas  about  convergence  of  Fourier  series  gave 
tremendous  impetus  to  the  development  of  mathematical  analysis  in  the  19th 
century. 

To  determine  the  coefficients  an  that  make  (3.6)  valid,  one  could  proceed 
formally  and  multiply  both  sides  of  (3.6)  by  sin  may  for  some  fixed  but  arbitrary 
index  m,  and  then  integrate  from  x  =  0  to  x  =  tt  to  obtain 


PIT  f'  7T  00 

/  f(x)smmxdx  =  /  an  sin  nx  sin  rnx  dx. 

Jo  Jo  n=1 

If  we  interchange  the  integral  and  the  summation  sign  we  obtain 

p7T  °°  n  7T 

/  f(x)  sin  mx  dx  =  an  /  sin  nx  sin  mx  dx. 

Jo  n=1  Jo 


The  integrals  in  the  right  side  summand  can  be  calculated.  We  find  that 

7 T 

sin  nx  sin  mx  dx  =  0,  n  ^  m.  (3-7) 

Amazingly,  this  fact  forces  the  infinite  series  to  collapse  to  a  single  term.  So 
the  only  term  that  survives  in  the  infinite  series  is  the  one  where  the  dummy 
summation  index  n  hits  m,  the  fixed  index.  Therefore  we  get 

/* 7T 

/  f  (x)  sin  rnx  dx  =  arn 

Jo 

Because  m  is  an  arbitrary  fixed  index,  we  can  replace  it  by  n  and  so  we  have 
shown  that  the  coefficients  in  (3.6)  are  given  by 


7T 


sin  rnx  dx  =  —a 


m 


2  r 

an  =  —  f(x)sinnxdx,  n  =  1,2,3,....  (3-8) 

71  Jo 

Therefore,  in  summary,  in  a  formal  sense6  the  solution  to  the  heat  flow 
problem  (3.1)  (3.2)~(3.4)  is  given  by  (3.5),  where  the  coefficients  an  are  given 

6  A  formal  calculation  in  mathematics  is  one  done  without  complete  rigor,  but  can 
be  verified  under  special  assumptions. 
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by  (3.8).  Indeed,  one  can  verify  that  this  is  a  solution  for  reasonable  initial 
temperature  distributions  f(x). 

What  we  described  is  Fourier’s  method;  namely,  we  solve  a  general  ini¬ 
tial  boundary  value  problem  by  superimposing  a  set  of  solutions  to  the  pure 
boundary  value  problem  and  then  choosing  the  constants  such  that  the  initial 
condition  is  satisfied  as  well.  The  key  to  the  calculation  is  that  the  functions 
sin nx  satisfy  a  relation  like  (3.7),  which  is  called  an  orthogonality  condi¬ 
tion.  It  is  analogous  to  the  expansion  of  a  vector  in  Mn  in  terms  of  orthogonal 
vectors,  or  a  basis.  Orthogonality  enables  easy  calculation  of  coefficients  in  the 
infinite  series  (3.6).  The  infinite  series  (3.6)  for  f(pc)  in  terms  of  the  orthogonal 
functions  sin  nx  is  called  an  orthogonal  expansion.  We  have  glossed  over 
several  key  steps,  but  in  the  next  section  we  state  the  definitions  and  concepts 
in  much  greater  detail  and  show  their  validity. 

Fourier’s  method  is  the  basic  technique  for  solving  boundary  value  problems 
for  PDEs  on  bounded  domains.  One  obvious,  remaining,  question  is  how  we 
obtained  the  set  of  solutions 

un{x,  t)  =  e~n  sin  nay  n  =  0,1,2,... 

that  forms  a  basis  for  the  expansion.  The  answer  is:  from  a  differential  eigen¬ 
value  problem.  We  fully  discuss  this  eigenvalue  problem  in  Chapter  4.  But  for 
the  present,  we  focus  attention  on  the  representation  of  functions  by  infinite 
series  like  the  orthogonal  expansion  (3.6). 


EXERCISES 

1.  Consider  the  initial  boundary  value  problem  for  the  wave  equation 

Utt  =  C2uxx,  0  <x  <1 r,  t  >  0,  (3.9) 

i/(0,  t)  =  u(n,  t)  =  0,  t  >  0,  (3.10) 

u(x,  0)  =  /(#),  ut(x,  0)  =0,  0  <  x  <  7r,  (3T1) 

on  a  bounded  spatial  domain.  Recall  that  this  system  models  the  small, 
transverse  deflections  of  a  taut  string  fixed  at  the  two  endpoints. 

a)  Verify  that  the  set  of  functions, 

un(x,  t)  =  cos  net  sinnx,  n  =  1,2,..., 

satisfies  the  PDE  (3.9)  and  boundary  conditions  (3.10). 

b)  These  solutions  are  called  fundamental  modes  and  they  represent 
standing  waves.  Plot  several  time  profiles  of  the  fundamental  mode 
corresponding  to  n  =  2  and  c—  1. 
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c)  Superimpose  the  fundamental  modes  by  defining 

u(x,  t )  =  an  cos  net  sin  nx 

n 

and  determine  the  coefficients  an  for  which  u(x,  t )  satisfies  the  initial 
conditions  (3.11). 

d)  Show  formally  that  u(x,t)  solves  the  initial  BVP  (3.9)-(3.11).  (You 
may  differentiate  term-by-term.)  These  calculations  amount  to  resolv¬ 
ing  an  arbitrary  wave  into  its  fundamental  modes. 

e)  Find  a  formal  solution  as  above  if  the  initial  conditions  are  changed  to 

u(x,  0)  =  0,  Ut(x,  0)  =  g(x),  0  <  x  <  7 r. 
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From  elementary  calculus  the  reader  is  familiar  with  the  notion  of  expanding  a 
given  function  f(x)  in  an  infinite  series  of  simpler  functions.  For  example,  if  / 
has  infinitely  many  derivatives  at  a  point  x  =  a,  then  its  Taylor  series  about 
x  =  a  is 


oo 


f(x)  =  ^2cn(x-a)n, 


n— 0 


where  the  cn,  the  Taylor  coefficients,  are  given  by 

cn  =  f{n\a)/n\  . 

For  example,  the  Taylor  series  for  the  exponential  function  about  x  =  a  =  0  is 


,cc 


1  9  1  O 

1  +  x  H — -x  H — -x  + 

2!  3! 


Another  type  of  series  is  a  Fourier  series.  If  /  is  defined  and  integrable  on 
the  interval  [ — tt,  tt]  ,  then  its  Fourier  series  is  an  infinite  series  of  the  form 


ao 


oo 


+  2_.  ian  cos  nx  +  bn  sin  nx) , 


(3.12) 


n— 1 


where  the  coefficients  are  calculated  by  the  formulas 


1 

an  — 


7 r 


/TT 

f  (x)  cos  nxdx,  n  =  0,1,2 

-TT 


—  TT 
‘TT 


bn  — 


i  n 

—  /  f(x)  sinnxdx,  n  =  1,2 
^  J  —  TT 
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For  Taylor  series  we  use  the  set  of  polynomial  functions  1,  x  —  a,  (x  —  a)2,  . . . 
as  the  basis  for  the  expansion,  and  for  Fourier  series  we  use  the  trigonometric 
functions  1,  cosay  cos  2x,  . . .,  sin  ay  sin  2ay  ...  as  the  basis  for  the  expansion. 
Each  expansion  has  its  advantages.  From  calculus  we  know  that  a  few  terms 
of  the  Taylor  series  can  provide  a  good  approximation  to  /  locally  near  x  =  a. 
On  the  other  hand,  the  first  few  terms  of  a  Fourier  series  can  provide  a  good 
global  approximation  to  periodic  signals. 


Example  3.1 


We  illustrate  how  to  calculate  the  Fourier  series  (3.12)  of  the  step  function 

f(x)  =  0,  —7 T  <  X  <  0,  f(x)  =  1,  0  <  X  <  7T 
on  the  interval  [ — tt,  tt]  .  The  coefficients,  given  above,  are 


U-{)  — 


1 

7 T 


/7T  l  /*  TT 

f(x)dx  =  —  ldx  =  1, 

-7T  TT  J  Q 


■7 r 
*7 r 


a 


n 


i  r  i  r 

—  /  f  (x)  cos  nx  dx  =  —  /  cos  nxdx 

TT  «/  — 7r  TT  J  q 


1 


sin  no: 


U7T 


7T 

0 


o, 


and 


'n 


i  r  i  r 

—  /  f(ad  sin  no;  da;  =  —  /  sin  no;  da; 
tt  J —  7r  tt  70 


1 


cos  no; 


7T7T 


7T 
0  7 


= - (cos  T17T  —  1)  =  —  (l  —  (— l)n+1)  . 

Tin  Tin  v  7 

Here  we  used  the  fact  that  cosn7r  =  (— l)n.  Thus,  the  Fourier  series  (3.12)  is 

f(x)  =  — | - (  sin  x  H —  sin  3a;  H —  sin  5a;  H —  sin  7a;  +  •  • 

w  2  tt\  3  5  7 

A  plot  is  of  the  first  five  terms  is  shown  in  Figure  3.1.  Note  that  the  value  of 
the  Fourier  series  at  the  discontinuity  x  =  0  seems  to  be  1/2,  the  midpoint 
value  at  the  jump.  □ 


In  this  section  we  want  to  understand  the  underlying  idea  of  a  Fourier 
expansion  by  allowing  the  basis  functions  to  be  any  set  of  functions  with  an 
orthogonality  property.  The  result  will  be  a  general  theory  that  incorporates 
Fourier  series  as  a  special  case.  Orthogonal  is  another  word  for  perpendicular, 
but  it  has  meaning  in  more  general  settings,  such  as  with  respect  to  sets  of 
functions. 
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Figure  3.1  A  five-term  Fourier  ‘wavy’  series  approximation  of  the  function 
f(x)  =  0,  —7 r  <  x  <  0,  f(x)  =  1,  0  <  x  <  7r.  More  terms  give  a  better 
approximation  in  the  open  intervals  (— 7r,  0)  and  (0,7r) 


To  fix  the  context,  let  us  define  a  space  of  functions  L2  =  L2[a,  b]  consisting 
of  all  the  real- valued  functions  /  defined  on  the  interval  [a,  b]  for  which 


dx  <  oo. 


When  this  property  holds,  we  say  that  /  is  square  integrable  and  write  /  E 
L2[a,  b\.  In  this  set  of  functions  we  define  a  inner  product  (or  scalar  product), 
which  is  analogous  to  the  dot  product  for  vectors.  If  /,  g  E  L2,  then  the  inner 
product  of  /  and  g  is  a  real  number,  denoted  by  (/,  g),  and  is  defined  by 


(/.s) 


f(x)g(x)dx. 


By  the  properties  of  integrals,  the  inner  product  of  functions  satisfies  the 
same  properties  of  dot  products  for  vectors,  namely,  symmetry ,  additivity ,  and 

7  The  integral  here  is,  appropriately,  the  Lebesgue  integral  studied  in  advanced  anal¬ 
ysis  courses  rather  than  the  Riemann  integral,  which  is  studied  in  elementary  cal¬ 
culus;  however,  there  will  be  no  harm  in  interpreting  the  integrals  in  this  text 
as  a  Riemann  integrals  since  all  of  the  functions  that  we  examine  are  Riemann 
integrable  (and  therefore  Lebesgue  integrable). 
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homogeneity. 


(. f,9 )  =  ( g,f ), 

(f,9  +  h)  =  ( f,g )  +  (f,h), 

(f,ag)  =  a(f,g),  «£l. 

We  also  introduce  the  notion  of  length,  or  size,  in  the  set  L2[a,  b\.  If  /  G  L2[a,  b], 
then  we  define  the  norm  of  /  to  be  the  nonnegative  real  number 


f\\  =  VUJ) 


Observe  that  this  definition  is  exactly  analogous  to  the  definition  of  the  length 
of  a  vector  v  in  Euclidean  space  Mn:  ||v||  =  •  v.  Introducing  an  inner  prod¬ 

uct  puts  a  geometric  structure  on  L2  that  allows  us  to  define  lengths  and 
perpendicularity,  just  as  in  Mn. 

The  set  L2  is  an  example  of  a  function  space  (a  vector  space  of  functions) 
analogous  to  Mn,  which  is  simply  called  a  vector  space.  One  can  add  functions 
in  L2  in  the  usual  way  and  multiply  functions  in  L2  by  scalars  (real  numbers). 
Those  operations  satisfy  the  same  rules  as  addition  of  vectors  in  Mn  and  mul¬ 
tiplication  of  vectors  by  scalars.  Thus  L 2  and  Mn  are  similar  in  their  algebraic 
structures  and  their  geometric  structures.  However,  a  big  difference  is  that  Mn 
is  finite  dimensional,  whereas  the  function  space  L 2  is  infinite  dimensional.  This 
means  there  is  not  a  finite  number  of  functions  that  form  a  basis  for  L2 . 

By  the  way,  for  notational  convenience  we  often  drop  the  explicit  depen¬ 
dence  of  functions  on  x  and  write  just  /  in  place  of  f(x).  In  some  instances, 
when  /  represents  a  signal,  we  think  of  x  as  time. 


Remark.  It  should  be  clear  that  other  important  spaces  of  functions  on  the 
closed  interval  [a,  b],  e.g.,  continuous  functions,  piecewise  continuous  functions, 
differentiable  functions,  and  so  forth,  are  subsets  of  L 2.  So  the  set  of  functions 
L 2  contains  all  the  functions  we  normally  work  with  in  engineering  and  science. 


Example  3.2 

Let  f(x)  =  x 3  and  g(x)  =  \  +  x  on  the  interval  [0, 1].  These  functions  are  clearly 
in  L2 .  The  inner  product  is 


(Lg) 


x3  (1  +  x)dx 


9 

20' 


The  norm  /  is  measured  by 
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The  norm  is  the  square  root  of  the  area  under  the  graph  of  the  square  of  the 
given  function.  □ 


We  say  that  a  set  of  functions  /i,  /2,  /3, . . .  in  L2[a,  b]  is  orthogonal  on 
the  interval  [a,  b ]  if  (/;,  fj )  =  0  for  i  ^  j  and  ( fj,fj )  =  ||/j||2  0. 


Example  3.3 


The  set  1,  cosay  cos  2ay  cos  3ay  . . sin  ay  sin  2 ay  sm3ay  ...  is  orthogonal  on 
— 7T,  7r]  because  if  we  integrate  the  product  of  any  two  of  these  over  the  interval 
— 7T,  7r]  we  get  zero.  The  reader  should  verify  this  statement  using  formulas 
from  a  calculus  book. 

For  another  example,  we  observed  in  the  last  section  that 

p 7T 

(sin  may  sin  nx )  =  /  sin  mx  sin  nx  dx  =  0,  n  /  m. 

Jo 

Thus  the  set  of  functions  sin  ay  sin  2 ay  ...  is  orthogonal  on  the  interval  [0,7r  . 
The  reader  should  check  that  the  power  functions  l,ay  a;2, . . .  that  appear  in  a 
Taylor  series  do  not  form  an  orthogonal  set  on  any  interval  [a,  b].  □ 


Finally,  we  remark  that  functions  can  be  normalized  in  the  same  way  that 
a  vector  can  be  turned  into  a  unit  vector,  namely  by  dividing  by  its  norm,  or 
size.  if/  G  T2[a,  6],  then  the  function  g  defined  by 


/  0) 


has  unit  norm.  If  each  function  in  an  orthogonal  set  of  functions  has  unit  norm, 
then  the  set  is  called  orthonormal  system.  Thus,  orthogonal  systems  functions 
can  always  be  turned  into  orthonormal  systems. 


Example  3.4 


The  set  of  functions 


nnx 


sin 


l 


n  —  1,2, 


is  orthogonal  on  the  interval  [0,/],  and 
tions 


sin 


mrx 


1/2.  Therefore,  the  func- 


2  U7TX 

1 sm  T 


form  an  orthonormal  set  of  functions  on  [0,  /].  An  expansion  of  a  function  f(x) 
defined  on  [0,  l]  in  terms  of  this  set  of  orthonormal  functions  is  called  a  Fourier 
sine  series.  □ 
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Convergence 

The  next  question  is  a  key  one.  Can  an  arbitrary  function  /  E  L2 
be  expanded  in  an  infinite  series  of  a  given  orthogonal  system  of  functions 
/i,  /2,  /3, . . .?  That  is,  can  we  write 

oo 

f(x)  =  ^2cnfn(x)  (3.13) 

n—  1 

for  some  set  of  constants  cn,  and  in  what  sense  can  we  interpret  the  convergence 
of  the  series  to  f(x)l  Physically,  we  think  of  (3.13)  as  decomposing  a  signal  / 
into  fundamental  modes  /i,  /2,  /3, . . ..  This  is  the  same  question  that  we  asked 
in  Section  3.1  regarding  the  feasibility  of  expanding  a  given  initial  temperature 
f(x)  into  a  sine  series. 

This  question  of  convergence  is  of  course  not  an  issue  in  finite  dimensional 
spaces.  But  in  function  spaces  this  is  a  delicate  question  and  there  are  sev¬ 
eral  possible  answers,  depending  on  how  we  measure  error  in  a  partial  sum. 
Precisely,  we  formulate  three  types  the  errors  that  can  occur  in  an  TV-term 
approximation 

N 

SN(x)  =  ^  Cnfn(x) 

n—1 

to  a  function  f(x).  Each  of  these  error  expressions  leads  to  a  different  definition 
of  convergence  of  the  infinite  series. 

1.  The  pointwise  error  Ejy(x)  is  the  function 


N 

EN{x)  =  f(x)  -  SN(x)  =  f(x)  -  ^2  Cnfn(x). 

n—1 


Thus,  for  a  fixed  number  of  terms  TV,  Ejy(x)  is  a  function  whose  val¬ 
ues  give  the  error  at  each  point  x.  If  for  each  fixed  x  in  [a,  b }  we  have 
liniTv^oo  En(x)  =  0,  then  we  say  that  the  infinite  series  (3.13)  converges 
pointwise  to  /  on  the  interval  [a,  b\;  in  this  case,  for  each  fixed  x  the  series 
is  a  numerical  series  that  converges  to  the  numerical  value  /(#),  at  that 
x.  Pointwise  convergence  means  that  the  error  goes  to  zero  at  each  point. 
This  is  the  type  of  convergence  that  a  calculus  student  first  encounters. 


2.  The  mean-square  error  in  the  approximation  is  the  number  ejy  defined 

by 

nb  rb  N 

&N=  |  f(x)  -  SN(x)  \2dx=  |  f(x)  -  ^2  °nfn(x )  |2  dx. 

J  CL  J  CL 

Note  that  is  just  the  square  of  the  pointwise  error,  integrated  over  the 
interval  [a,  b\.  Thus,  the  mean-square  error  is  an  integrated  average  over  all 
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the  pointwise  errors.  We  say  that  the  infinite  series  (3.13)  converges  to  / 

in  the  mean-square  sense  if 


lim  e  tv  =  0 . 

TV— )>oo 


Mean-square  convergence  is  also  called  convergence  in  L2[a,  b\.  Thus,  we 
can  write  the  last  line  in  terms  of  the  norm  as 


lim  11/  —  *SV||2  =  0. 

TV— >•  oo 

Note  that  we  can  remove  the  square  on  the  norm,  which  is  common.  It 
requires  that  the  integrated  pointwise  error-squared  go  to  zero  as  more 
and  more  terms  are  taken.  In  advanced  courses  it  is  shown  that  pointwise 
convergence  does  not  imply  mean-square  convergence,  and  conversely. 

3.  The  uniform  error  £jy  is  the  maximum  pointwise  error  over  the  interval 
a,  6],  or 


N 


£n  =  max  |  f(x)  —  Sn(x) 

a<x<b 


max  |  f(x) 

a<x<b 


^2 Cnfn(.X ) 


n—  1 


Uniform  convergence  is  a  stronger  type  of  convergence;  it  implies  both 
mean-square  and  pointwise  convergence.  We  say  that  the  series  (3.13)  con¬ 
verges  uniformly  to  f[x)  on  [a,  b }  if 


lim  £m  =  lim  <  max  I  f(x)  —  Sm(x)  I  >  =0. 

N-> oo  oo  \a<x<b  1  V  7  ;  'j 

Thus,  for  any  given  tolerance  e  >  0,  we  can  find  a  number  N  (representing 
the  number  of  terms)  such  that  for  all  n  >  N  we  have 

max  \  f{pc)  —  Sn(x)  |<  e. 

a<x<b 

That  is,  the  error  can  be  made  uniformly  small  over  the  entire  interval  by 
choosing  the  number  of  terms  large  enough.  Uniform  convergence  depends 
only  on  the  tolerance  e  and  not  on  the  position  in  the  interval  where  the 
error  is  calculated.  [In  some  cases  “max”  must  be  replaced  “sup”  (for  supre- 
mum );  see  an  advanced  calculus  text,  e.g.,  Spivak  (2008).] 

By  the  way,  all  of  the  definitions  above  are  valid  for  series  of  functions, 
whether  or  not  the  functions  fn(x)  are  mutually  orthogonal. 
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Example  3.5 

Investigate  the  convergence  the  series 


OO 


B-U 


nx2n 


1  <  X  <  1. 


n= 0 


Here  we  can  find  the  partial  sums  explicitly.  We  have 


N 


N 


Sn{%)  =  l)nx2n  =  x2)n  = 


n— 0 


n— 0 


1  -  (~X2) 

1  +  x‘ 


2\7V+1 


where  we  have  used  the  geometric  sum 


1  +  z  +  z2  +  z3  H - h  zN  = 


1  -  zN+1 
1  -  z 


Simplifying, 


Sn(x)  = 


1  +  (-l)n+2X2('N+1') 


1 


—  1  <  X  <  1 


1  +  X2  1  +  X2  ’ 

Therefore  the  series  converges  pointwise  to  1/(1  +  x2)  on  —1  <  x  <  1. 
To  check  uniform  convergence  we  note  that  the  uniform  error  is 


1 

£n  =  max  I  Sn(x)  —  - - v 

-km  1  +  x2 


X 


2(iV+l) 


=  max 


1<x<1  1 + xA 


1 

2 


This  does  not  go  to  zero  as  iV  0,  so  the  convergence  is  not  uniform. 
For  mean-square  convergence, 


E. 


n 


sn(x) 


1 


1  +  x[ 


'1  x4AT+4 

dx  =  /  - - 7T-^dx. 


.i  (1  +  ^2): 


But,  on  the  interval  —  1  <  x  <  1  we  have  ^1+^2^2  <  1.  Therefore 

■  i 


E 


N 


■1  ™47V-f  4  /»!■  9 

dx  <  /  x4Ar+4dx  = 


(1  +  ^2): 


4W  +  5 


0,  as  TV  — oo. 


Therefore  the  series  converges  in  the  mean-square  sense.  □ 


A  novice  may  not  appreciate  subtle  differences  in  convergence  and  wonder 
why  we  have  defined  three  types  (actually,  there  are  others).  Suffice  it  to  say 
that  the  error  may  not  go  to  zero  in  a  pointwise  or  uniform  manner,  but  that 
does  not  mean  the  approximation  is  not  useful;  a  weaker  form  of  convergence, 
like  mean-square  convergence,  may  be  all  that  is  required. 

Now  suppose  (3.13)  holds  in  the  sense  of  mean-square  convergence  where  / 
is  a  given  function  in  L2  and  fn  is  a  given  orthogonal  system  in  L2.  How  can  we 
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compute  the  coefficients  cn?  The  demonstration  here  is  exactly  that  which  is 
given  in  Section  3.1  where  we  showed  how  to  compute  the  coefficients  in  a  sine 
series.  Orthogonality  is  crucial.  We  assume  in  this  argument  that  the  series  in 
(3.13)  can  be  integrated  term  by  term  (it  can  be  proved).  If  we  multiply  both 
sides  of  (3.13)  by  /m,  where  m  is  a  fixed,  but  arbitrary  index,  and  integrate 
over  the  interval  [a,  6],  we  get,  using  the  inner  product  notation, 


if,  fm) 


oo 

Y,Cnfn(x)fm(x)dx 

n— 1 


oo 


Cn  (fn  i  fm) 


n=  1 


Cm  (fmi  fm)  —  Cm||«/ra 


Note  that  the  infinite  sum  collapsed  to  a  single  term  because  of  the  orthogo¬ 
nality  of  the  fn.  So  the  cn  are  given  by 

Cn  =  n  /||2C f,fn),  n=  1,2,....  (3.14) 

II  Jn  II 

We  summarize  in  the  following  theorem. 


"heorem  3.6 


Let  /  G  L2[a,  b]  and  {/n}  be  an  orthogonal  system  of  L2  functions  on  [a,  b\.  If 


oo 

f(x)  =  YCnfn(x)  (3-15) 

n=  1 

in  the  mean  square  sense,  then  the  coefficients  cn  are  given  by  (3.14).  □ 


The  cn  are  called  the  Fourier  coefficients,  and  when  the  coefficients  are 
computed  in  this  way,  the  orthogonal  series  (3.15)  is  called  the  generalized 
Fourier  series  for  /.  In  analogy  with  finite  dimensional  vector  spaces,  like  Mn, 
the  Fourier  coefficient  cn  is  interpreted  as  the  projection  of  /  onto  the  subspace 
defined  by  fn. 


Remark  3.7 

If  the  series  expansion  (3.15)  is  valid  in  the  mean-square  sense  and  the  fn  are 
orthonormal,  then  the  Fourier  coefficients  (3.14)  are  given  simply  by 


(/,/n),  (fn  orthonormal).  □ 


(3.16) 
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There  is  another  useful  property  of  Fourier  coefficients. 

Theorem  3.8 

(Best  approximation)  Let  {cn}  be  the  set  of  Fourier  coefficients  in  the  finite 
expansion  of  /  E  L2[a,  b }  in  the  orthonormal  system  /i,  /2,  •  •  • ,  /jv-  If  {un}  is 
any  other  sequence  of  coefficients,  then 

N  N 

ii/-  yy„/„  ii2  <  ii/-  5>„/„u2.  (3.17) 

n=l  n=l 

That  is,  the  mean-square  error  is  minimized  for  the  approximation  with  the 
Fourier  coefficients  cn  given  by  (3.16).  □ 

This  result  is  called  best  approximation.  It  answers  the  question  how  we 
can  best  approximate  /  by  a  finite  linear  combination  of  the  fn.  Just  use  the 
Fourier  coefficients. 

The  demonstration  of  (3.17)  is  straightforward  using  the  definition  of  norm 
and  properties  of  the  inner  product.  We  have 

N  /  N  N  \ 

||/  -  ^  anfn\\2  =  if  ~  X!  ) 

n— 1  \  n= 1  n— 1  / 

N  N 

=  (/,/)-2^3a„(/,/n)  +  L!a2 

n— 1  n=l 

AT  N  N  N 

=  (/,/)-  2  a«c«  +  X  a«  +  53  c« _  X  c« 

n=l  n=l  n=l  n=l 

N  N 

=  (/,  /)  -  X  °2  +  XW  ~ c")2 

n=l  n=l 

N  N 

—  II  f  ^  ^  ^ nfn  ||  T  ^  ^  (^n  ^n)  • 

n—1  n=l 

Because  the  second  term  on  the  right  is  nonnegative,  (3.17)  is  valid.  If  the  fn 
are  not  orthonormal,  and  just  orthogonal,  then  the  same  result  is  true  with  the 
cn  given  by  (3.14).  □ 

In  the  last  line  of  the  proof  we  used  the  fact  that 
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which  the  reader  should  verify.  But  this  equality  leads  to  another  interesting 
inequality.  Because  the  left  side  is  nonnegative,  we  have 

N 

Ec«  -  ii/ii2- 

n= 1 

This  inequality  is  valid  for  each  TV,  and  so 

oo 

EC«<  ||/||2,  (Bessel’s  Inequality) 

n=  1 

which  is  Bessel’s  inequality.  Its  shows  that  the  series  of  squared  Fourier 
coefficients  converges,  and  thus  cn  0  as  n  —>  oo.  So  the  Fourier  coefficients 
get  smaller  and  smaller. 


Example  3.9 

(Best  approximation)  Note  that  the  set  of  functions  fn(x)  =  sin  nx  forms 

an  orthogonal  set  of  functions  on  [0,  tt]  .  Find  the  best  approximation  of  the 

2 

function  f(x)  =  8x(n  —  x)e~x  of  the  form 

ci  sin  x  +  C2  sin  2x  +  C3  sin  3x  -f  C4  sin  Ax  +  C5  sin  5x. 

We  know  that  the  cn  are  the  Fourier  coefficients  given  by 

(/,  sin  nx)  2  (' 

cn  =  - =7  =  —  /  j(x)  smnx  ax. 


sm  nx 


7 r 


Using  a  calculator  we  obtain 


Ci  —  3.708,  C2  —  4.040,  C3  —  2.230,  C4  —  0.798,  C5  —  0.295. 

The  reader  should  plot  f(x)  and  the  Fourier  approximation  on  the  interval 
[0, 7 r]  to  observe  how  much  the  two  differ.  □ 


Note  that  all  the  previous  results  assume  that  cnfn  converges  in  the 
mean  square  sense  to  /.  So  we  need  another  result  to  round  out  our  discussion: 
namely,  if  the  {fn}  form  an  orthonormal  set  and  we  form  the  generalized  Fourier 
series  (3.13)  with  cn  the  Fourier  coefficients  given  by  (3.16),  does  the  series 
automatically  converge  to  /  in  one  sense  or  another?  The  answer  to  the  question 
is  yes,  in  the  mean-square  sense,  provided  that  the  orthonormal  system  {fn} 
is  complete. 
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Definition  3.10 

An  orthonormal  system  {/n}  in  L 2  is  said  to  be  complete  if,  and  only  if,  there 
is  no  nontrivial  /  E  L2  which  is  orthogonal  to  all  the  fn.  That  is,  if  (/,  fn)  =  0 
for  n  =  1,2,3,...,  then  /  =  0.  □ 

Thus,  an  orthonormal  sequence  {/n}  is  complete  if  the  only  function  having 
all  its  Fourier  coefficients  vanish  is  the  zero  function.  Sometimes  it  is  difficult 
to  show  completeness,  and  we  shall  just  state  whether  a  given  orthonormal 
system  is  complete.  It  is  shown  in  advanced  treatments  that  completeness  is 
equivalent  to  the  property  that  any  function  /  E  L2  can  be  expanded  in  a 
generalized  Fourier  series  (3.13),  where  the  cn  are  the  Fourier  coefficients,  and 
that  the  series  converges  to  /  in  the  mean-square  sense.  Furthermore,  it  can  be 
proved  that  completeness  is  equivalent  to  strict  equality  holding  in  the  Bessel 
inequality;  i.e.,  {fn}  is  complete  if,  and  only  if,  for  each  /  E  L2, 

oo 

Y2cl  =  ii/ii2- 

n= 1 

This  equation  is  called  Parseval’s  equality. 

Pointwise  convergence  results,  or  even  stronger  uniform  convergence  results, 
are  more  difficult  to  obtain.  Typically,  for  these  these  types  of  results  continuity 
and  differentiability  conditions  on  the  function  /  are  required.  We  state  these 
results  in  the  next  section  for  orthogonal,  trigonometric  systems. 

We  can  extend  the  modal  interpretation  of  the  generalized  Fourier  series  still 
further.  We  can  think  of  /  as  a  signal  and  the  orthonormal  set  {/n}  as  fun¬ 
damental  modes.  The  Fourier  coefficient  cn  determines  the  contribution  of  the 
nth  mode,  and  the  generalized  Fourier  series  is  the  decomposition  of  the  signal 
into  fundamental  modes.  The  sequence  of  squared  coefficients,  c2,  c|,  c2, . . .,  is 
called  the  energy  spectrum,  and  c2  is  called  the  energy  of  the  nth  mode;  by 
Parseval’s  equality,  the  total  energy  in  the  signal  is  the  norm-squared,  ||/||2. 


EXERCISES 


The  trigonometric  identities 


1 


sin  a  sin  b  =  -(cos  (a  —  b)  —  cos  (a  +  b )) 
2 

1 

sin  a  cos  b  =  -(sin(a  +  b)  +  sin(a  —  6)), 

2 


along  with  others,  are  useful  in  the  calculations. 

1.  Derive  a  trigonometric  identity  similar  to  the  ones  above  for  cos  a  cos  b. 
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2.  a)  Verify  that  the  set  of  functions  1,  cosay  cos2ay  . . .  forms  an  orthogonal 

system  on  the  interval  [0,  tt  . 

b)  Verify  that  the  set  of  functions 

/ mrx\ 

cos  y  — - — J  ,  n  =  0, 1,  2  , 

form  an  orthogonal  system  on  the  interval  [0,  /]. 

c)  If 

oo 

f(x)  =  E  <V*  COS 

n= 0 

in  the  mean-square  sense  on  [0,  /],  what  is  the  formula  for  the  cn?  This 
series  is  called  the  Fourier  cosine  series  for  /  on  [0,/]. 

3.  Let  f(x)  =  0  for  0  <  x  <  1  and  f(x)  =  1  for  1  <  x  <  3. 

a)  Find  the  first  4  nonzero  terms  of  the  Fourier  cosine  series  of  /. 

b)  What  is  the  sum  of  this  series  on  0  <  x  <  3? 

c)  Why  does  the  series  converge  to  f(pc)  in  the  mean-square  sense? 

d)  Find  the  value  of  the  infinite  sum 

111111  1 
1  H-  —  —  —  —  —  +  —  +  —  —  —  —  —  +  •  •  •  . 

2  4  5  7  8  10  11 


4.  Consider  the  infinite  series 


oo 


^-^(1  —  x)xn  1 ,  0  <  x  <  1 


n—  1 


a)  Show  that  the  series  converges  pointwise.  Hint:  Find  the  partial  sums. 

b)  Show  that  the  convergence  is  not  uniform. 

c)  Show  that  the  series  converges  in  the  mean-square  sense. 


5.  Consider  the  infinite  series 


oo 

E 

n— 1 


n 


n  —  1 


1  +  n2x2  1  -f  (n  —  l)2x2  J  ’ 


0  <  x  <  1 


a)  Show  that  the  series  converges  pointwise.  Hint:  The  series  ‘telescopes.’ 

b)  Show  that  the  convergence  is  not  uniform. 

c)  Show  that  the  series  does  not  converge  in  the  mean-square  sense. 
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6.  If  cn  are  Fourier  coefficients  of  /  and  fn  is  an  orthonormal  set,  show  that 


'  N  N  \ 

E  /  -  E  Cn^n ) =  0 

Kn=l  n—  1  / 


7.  (Gram— Schmidt  orthogonalization)  We  know  from  elementary  linear 
algebra  that  any  set  of  linearly  independent  vectors  may  be  turned  into  a 
set  of  orthogonal  vectors  by  the  Gram — Schmidt  orthogonalization  process. 
The  same  process  works  for  functions  in  L2[a,  b\.  Let  /i,  /2,  /3, ...  be  an 
independent  set  of  functions  in  L2.  Define  the  set  of  functions  gn  by 


9 1  —  J 1?  92  —  12 - M  _  i|2  9 1?  93  —  J3 - 1|^||2  #2 


11^] 


(/3,^l) 

-jj - 772“^!. 


Show  that  gn  is  an  orthogonal  set. 

8.  The  functions  1  independent  functions  on  the  interval  [—1, 1] 

a)  Use  the  preceding  exercise  to  generate  a  set  of  four  orthogonal  poly¬ 
nomials  Po(x), . . . ,  P3(x)  on  [—1,1].  These  are  called  Legendre  poly¬ 
nomials. 

b)  Find  the  best  approximation  of  ex  on  [—1, 1]  of  the  form 


cqPq(x)  +  CiPi(x)  +  C2P2(^)  +  C3P3(x) 


c)  Plot  ex  and  the  approximation  on  a  set  of  coordinate  axes. 

d)  What  is  the  pointwise  error?  What  is  the  maximum  pointwise  error 
over  [—1,1].  What  is  the  mean-square  error? 

9.  For  /,  g  £  L2[a,  6],  prove  the  Cauchy— Schwarz  inequality 

I  (f,g)  |<  ll/ll  Ibl  • 

Hint:  Define  Q(t)  =  (/  +  tg,  f  +  tg)  for  any  real  number  t.  Use  the  rules  of 
inner  product  to  expand  this  expression  and  obtain  a  quadratic  polynomial 
in  t;  because  Q(t)  >  0  (why?),  the  quadratic  polynomial  can  have  at  most 
one  real  root.  Examine  the  discriminant  of  the  polynomial. 


10.  Verify  the  Cauchy-Schwarz  inequality  for  f(x) 
0  <  x  <  1. 


x  and  g(pc) 


e  x  on 


11.  Let  f,ge  L2[a,  b\.  Show  that  the  triangle  inequality  \\f  -f  g\\  <  ll/ll  +  \\g\ 
holds.  Hint:  Consider  \\f  +  g\\2. 

12.  For  which  powers  r  is  the  function  f(x)  =  xr  in  L2[0, 1]?  In  L2[l,  oo)? 
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13.  Let  /  be  defined  and  integrable  on  [0,/].  The  orthogonal  expansion 


oo 


E 

n=  1 


bn  sin 


Tl  7TX 


l 


,  bn  =  jJ  f(x )  sin 


UTTX 


l 


dx , 


is  called  the  Fourier  sine  series  for  /  on  [0,  /].  Find  the  Fourier  sine  series 
for  /(x)  =  cosx  on  [0,  tt/2]  and  plot  a  6-term  approximation.  What  is  the 
Fourier  sine  series  of  f(x)  =  sinx  on  [0,  tt]  ? 

14.  Find  the  numbers  bi,  62  ,  b3  such  that 

,  v  ,  .  7TX  .  2i7TX  .  2i7TX 

y{x)  =  Oi  sin  —  +  b2  sin  — - h  o3  sin  — — 

EE/  E 

is  the  best  approximation  to  /(x)  =  1  on  (0,  2).  Draw  a  plot  of  the  approx¬ 
imation. 

15.  Expand  the  function  f(x)  =  1  on  [0, 1]  in  a  Fourier  sine  series  and  derive 
the  interesting  formula 

7 r  11 

—  =  sin  1  -| —  sin  3  H —  sin  5  +  •  •  •  . 

4  3  5 


3.3  Classical  Fourier  Series 


In  the  last  section  we  introduced  the  concept  of  representing  a  given  function 
f(x)  in  terms  of  an  infinite  series  of  orthogonal  functions  fn(x).  Now  we  focus 
those  concepts  on  a  special  set  of  orthogonal  functions  where  the  fn  are  given 
by  sines  and  cosines.  The  resulting  series  is  called  a  (classical)  Fourier  series, 
or  trigonometric  series.  In  this  section  we  drop  the  adjective  “classical.”  To 
contrast  it  with  Fourier  cosine  and  Fourier  sine  series,  sometimes  we  refer  to  it 
a  the  full  Fourier  series. 

We  work  on  an  arbitrary  symmetric  interval  [—/,/]  about  the  origin.  If  /  is 
an  integrable  function  on  [—/,/],  then  its  Fourier  series  is 


a  0 


00 


+  E 

n—  1 


U7TX  UTTX 

an  cos  — - - F  bn  sin  — - — 


(3.18) 


where  the  Fourier  coefficients  an  and  bn  are  given  by  the  formulas 

1  T17TX 

an  =  -  f(x)  cos — - — dx,  n  —  0,1,2,..., 

*  J-i  l 


1 


4 


-  1  /./  n7TX 

bn  =  y  j  j(x)  sm  — - — ax,  n  =  l,2, 


(3.19) 


(3.20) 
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Here,  the  set  of  functions 


1 

cos  — - 
2’  l 


nnx  nnx 
,  sin 


l 


n  =  1,2, 


(3.21) 


forms  an  orthogonal  system  on  the  interval  [—/,/].  They  are  playing  the  role 
of  the  fn {x)  in  formula  (3.13),  and  the  an  and  bn  are  playing  the  role  of  the 
coefficients  cn.  It  is  shown  in  more  advanced  texts  that  the  set  of  functions 
(3.21)  is  complete,  and  therefore  the  Fourier  series  (3.18)  converges  in  the 
mean-square  sense  to  /  when  /  E  L2[— /,/]. 


Example  3.11 

Let  f(x)  =  x  for  —l  <  x  <  /.  Then  the  Fourier  coefficients  are  easily  computed 
using  integration  by  parts  to  get 


1 


T17TX 

an  =  J  I  XCOS  —j~ 


b  -1- 

L frn  — 


i 
• i 


nnx 

"n  —  i  x  sm — - — ax  =  — 
l  l_i  l  T17T 


dx  =  0,  n  =  0, 1,  2, ... , 
21 


(-l)n+1,  n=  1,2, .... 


Therefore  the  Fourier  series  for  /  is 


2 1  (  7tx  1  2ttx  1  3ttx 

—  sin  — - sin  — - 1 —  sin  — - -  • 

7T  V  l  2  l  3  l 

We  make  two  observations.  First,  at  x  =  ±1  the  series  clearly  converges  to 
zero  and  thus  does  not  converge  to  f(pc)  at  these  two  points.  So  the  series 
does  not  converge  pointwise  to  f(pc)  on  the  interval  [—/,/].  It  does,  however, 
converge  pointwise  to  f{pc)  =  x  on  (—/,/).  Second,  the  derived  series  obtained 
by  differentiating  term-by-term, 


oo 


2£(-l) 


n+1 


nnx 


cos 


n=  1 


l 


does  not  converge  at  all,  much  less  to  the  derivative  f'(x)  =  1.  (Notice  the  nth 
term  does  not  go  to  zero.)  So  the  series  cannot  be  differentiated  term  by  term. 
We  do  know  from  Section  3.2  that  the  series  converges  to  /  in  the  mean-square 
sense.  □ 

For  the  Fourier  series  (3.18)  we  define  the  nth  harmonic  to  be 


nnx  nnx 

an  cos  — - - 1-  bn  sm  — - — 
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Each  harmonic  in  the  series  has  a  higher  frequency  (and  thus  more  oscillations) 
than  the  preceding  harmonic.  The  frequency  spectrum  of  the  series  is  the 
sequence  of  numbers  yn  defined  by 


7o 


(n  >  1). 


The  frequency  spectrum  is  a  measure  of  the  contribution  of  the  various  har¬ 
monics  in  the  decomposition  of  /.  The  numbers  7^,  n  >  0,  form  the  energy 
spectrum.  The  reader  is  invited  to  show  that  Parseval’s  equality  takes  the 
form 

7  +  F  (a«  +  =  yll/ll2- 

n=  1 

The  frequency  spectrum  for  f(x)  —  x  in  the  last  example  is  70  =  0,7n  = 
2//(n7r),  n  >  1.  It  is  often  graphed,  as  in  Figure  3.2,  to  show  visually  the 
contribution  of  each  harmonic. 


frequency  spectrum 


n 


Figure  3.2 


Frequency  spectrum  for  the  function  f(pc)  =  x  on 


—7 r,  7 r 


A  Fourier  series  simplifies  considerably  if  /  is  an  even  or  an  odd  function. 
First  observe  that  sin  is  an  odd  function  and  cos  is  an  even  function; 
moreover,  an  even  function  times  an  odd  function  is  odd.  Therefore,  if  /  is 
an  even  function,  then  the  product  f(x)  sin  which  is  the  integrand  in  the 
expression  for  bn  in  (3.18),  is  an  odd  function;  so  all  of  the  coefficients  bn  are 
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zero  because  an  odd  function  integrated  over  a  symmetric  interval  about  the 
origin  is  zero.  Hence: 

If  f  is  an  even  function ,  then  its  Fourier  series  reduces  to  a  cosine  series 
having  the  form 


ao 


oo 
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nnx 


an  cos 


l  ’ 


2  / '  nirx 

an  =  —  I  f(x)  cos — - — ax,  n  —  0,1,2, 
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Similarly, 

If  f  is  an  odd  function,  then  the  coefficients  an  in  (3.18)  vanish,  and  the  Fourier 
series  reduces  to  a  sine  series 

oo 

U7TX 

.  ~ 

n—  1 

2  fl  U7TX 

bn  =  -  /  j(x)  sin — - — dx,  n  =  1,2,.... 

This  last  fact  is  illustrated  in  the  previous  example,  where  we  found  the 
Fourier  sine  series  for  f(x)  =  x,  an  odd  function. 

Periodic  functions  on  R 

Given  any  integrable  function  on  (—1,1),  we  can  always  form  its  full  Fourier 
series  (3.18).  However,  the  Fourier  series  (3.18)  itself  is  a  2/-periodic  function, 
meaning  that  it  repeats  its  values  every  interval  of  length  21.  (Recall  that  a 
function  <fi  is  P-periodic,  or  periodic  of  period  P,  if  it  repeats  itself  in  every 
interval  of  length  P;  that  is,  f(x  +  P)  =  f(x)  for  all  x.)  This  means  that  the 
Fourier  series  represents  not  only  the  function  /  on  (—/,/),  but  also  represents 
its  21  periodic  extention  to  the  entire  real  line.  The  21  periodic  extension  of 
/  is  found  by  plotting  the  graph  of  /  on  (—1,1)  and  then  translating  its  graph 
to  the  left  and  to  the  right  to  every  interval  of  length  2 /;  that  is,  to  the  intervals 
(/,  3/),  (3/,  5 /), . . . ,  (—3 1,  —  l),  (—5 Z,  —3/), - See  Figure  3.3. 

If  a  function  /  is  defined  only  on  (0,/),  then  we  can  extend  it  to  an  even 
function  on  (—1,1).  See  Figure  3.4.  This  is  called  its  even  extention.  In  this 
case,  the  full  Fourier  series  reduces  to  a  cosine  series.  If  we  extend  /  to  an  odd 
function  on  (—1,1),  the  so-called  odd  extention  (see  Figure  3.5),  then  its  full 
Fourier  series  reduces  to  a  sine  series. 
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Figure  3.3  A  function  /  defined  on  (— Z,  l)  (solid)  and  its  2Z-periodic  extention 
(dashed) 


Figure  3.4  A  function  /  defined  on  (0,  l)  (solid)  and  its  even  extention 
(dashed)  to  (— Z,  l) 


Figure  3.5  A  function  /  defined  on  (0,Z)  (solid)  and  its  odd  extention 
(dashed)  to  (— Z,  l) 
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Example  3.12 


Consider  the  2-periodic  signal  shown  in  Figure  3.6,  which  is  called  a  triangular 
wave.  Analytically  it  is  given  by  f(x)  =  x  +  1  if  —1  <  x  <  0;  f(x)  =  1  —  x  if 
0  <  x  <  1;  and  otherwise  2-periodic.  We  compute  its  Fourier  series.  Here,  /  is 
an  even  function,  and  so  bn  =  0  for  all  n.  The  coefficients  an  are  given  by 

rl 


a 


n 


J  f(x)  cos  nTrxdx 


-l 

•0 


J  (x  +  1)  cos  nTrxdx  + 


x)  cos  m rxdx. 


When  n  =  0,  we  easily  get  ao  =  1.  For  n  >  1  we  can  calculate  an  by  hand  (using 
integration  by  parts),  use  a  calculator,  or  use  a  computer  algebra  package.  The 
first  five  coefficients  are 


d\  =  0.405,  <22  =  0,  <23  =  0.045,  a 4  =  0,  <25  =  0.016. 

Therefore,  a  four-term  Fourier  approximation  to  the  triangular  wave  is 

f(x)  ~  0.5  +  0.405  cos  ttx  +  0.045  cos  3nx  +  0.016  cos  5ttx. 

Figure  3.6  shows  a  plot  of  /  and  its  approximation.  From  the  last  section  we 
know  that  this  approximation  is  the  best,  in  the  mean-square  sense.  □ 


Figure  3.6  A  triangular  wave  and  its  4-term  Fourier  expansion 
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square  wave  of  period  2*pi 


-4 


-2 


8 


Figure  3.7  Graph  of  a  27r-periodic  square  wave.  See  Exercise  1 


Remark  3.13 

(Exponential  form  of  Fourier  Series)  Using  the  fact  that  that  sine  and 
cosine  functions  may  be  written  in  terms  of  the  complex  exponential  function, 
that  is, 

cos 0  =  -  (eie  +  e~ie )  ,  sin6>  =  —  (e10  -  e~ie) , 

2  '  ’  2  iK  ’ 

it  is  evident  that  that  the  full  Fourier  series  for  a  function  /  on  (—/,/)  can  be 
written  in  exponential  form  as 


oo 


f(x )  =  E 


cne 


iUTTx/l 


n=  — oo 


It  is  not  hard  to  see  that 
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/  imrx/l  —im7rx/l^  f  ^  TYl  ^  TL 

-z  l  2/,  m  =  n 

(Note  the  minus  sign  on  the  second  factor  in  the  integral.)  This  entails  a  mod¬ 
ification  of  the  definition  of  the  inner  product  for  complex-valued  functions 
to 


(f,g)  =  j  f(x)g(x)dx, 


where  the  overbar  means  complex  conjugation.  By  previous  methods  we  find 
that  the  complex  Fourier  coefficients  are  given  by 
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cn  =  —  I  f(x)e 
2 1  J -i  J 


—  inirx 


,ld 
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The  complex  form  is  useful  in  many  calculation  involving  exponential  and 
hyperbolic  functions.  We  can  always  return  to  the  real  Fourier  series  by  isolat¬ 
ing  the  real  and  imaginary  parts  of  the  series.  □ 


Finally  we  make  some  comments  about  convergence  of  Fourier  series.  We 
already  noted  that  if  /  is  square  integrable,  then  mean-square  convergence  is 
automatic.  To  obtain  pointwise  convergence  results  some  additional  assump¬ 
tions  must  be  made  regarding  the  smoothness  of  /.  Our  assumption  is  that  the 
graph  of  /  is  made  up  of  finitely  many  smooth  pieces;  the  function  is  not  neces¬ 
sarily  continuous.  This  will  cover  most  of  the  interesting  functions  encountered 
in  science  and  engineering.  A  function  /  is  piecewise  continuous  on  [a,  b]  if 
it  is  continuous  except  possibly  at  a  finite  number  of  of  points  in  [a,  b]  where  it 
has  simple  jump  discontinuities;  /  has  a  simple  jump  discontinuity  at  x  =  c  if 
both  one-sided  limits  /(c+)  and  f(c~)  exist  and  are  finite  at  c.  The  function 
may  or  may  not  be  defined  at  a  jump  discontinuity.  We  say  /  is  piecewise 
smooth  on  [a,  b]  if  both  /  and  f  are  piecewise  continuous  on  [a,  b\.  We  say 
/  is  piecewise  smooth  on  (—00,00)  if  it  is  piecewise  smooth  on  each  bounded 
subinterval  [a,  b]  of  the  real  line.  Then  the  basic  pointwise  convergence  theorem 
is  the  following: 


Theorem  3.14 

(Pointwise  convergence)  If  /  is  piecewise  smooth  on  [—/,/]  and  otherwise 
2/-periodic,  then  its  Fourier  series  (3.18)  converges  pointwise  for  all  x  E  R  to 
the  value  f(pc)  if  /  is  continuous  at  x,  and  to  the  average  value  of  its  left  and 
right  limits  at  x,  namely  ^(f(x~)  +  /(x+)),  if  /  is  discontinuous  at  x.  □ 

To  get  stronger  convergence  results,  like  uniform  convergence,  additional 
smoothness  conditions  on  /  are  required.  Also,  continuity  of  /  is  not  enough 
to  guarantee  pointwise  convergence;  incredible  as  it  may  seem,  there  are  con¬ 
tinuous  functions  whose  Fourier  series  diverge  at  every  point!  Next  is  a  basic 
result  for  uniform  convergence  of  the  Fourier  series  for  a  given  function. 


"heorem  3.15 


(Uniform  convergence)  Let  /  be  a  continuous  function  on  the  interval  [—/,/] 
with  f(—l)  =  /(/),  and  suppose  its  derivative  is  piecewise  continuous  on  that 
interval.  Then  the  Fourier  series 
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nirx  n7rx 

an  cos  — - - 1-  bn  sin  — - — 
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converges  absolutely  and  uniformly  to  /(x)  on  that  interval.  □ 


EXERCISES 


1.  Find  the  Fourier  series  for  the  27r-periodic  square  wave  shown  in  Figure  3.7. 
Sketch  a  two-term,  a  four-term,  and  a  six-term  approximation. 

2.  Show  that  the  Fourier  cosine  series  for  sinx  on  0  <  x  <  it  is 

2  4  cos  2  nx 

■7 r  TT  / 


7T 


7 r  z '  4n2  —  1 

n— 1 


Show  graphically  the  function  represented  by  the  series  on  the  interval 
(— 7r,7r).  On  the  entire  real  line? 

3.  Find  the  Fourier  series  for  /(x)  =  x2  on 

function  defined  on  all  of  R  to  which  the  Fourier  series  converges.  Is  the 
convergence  pointwise?  Use  the  Fourier  series  to  show 

.2 


— 7r,7r].  Sketch  a  graph  of  the 


7 r“  11 

—  =  1 - F  - 

12  4  9 


1 

16 
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Graph  the  frequency  spectrum. 
4.  Derive  the  series  representation 


7T 
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oo 
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n- 


(2  n-  1): 


using  the  function  /(x) 


x 


-7T  <  X  <  7T. 


5.  Find  the  Fourier  sine  series  of  /(x)  =  tt  —  x  on  (0, 7r). 

6.  Let  /(x)  =  1,  0<x<7t/2  and  /(x)  =0,  7t/2<x<7t.  Find  the  Fourier 
cosine  series  and  the  Fourier  sine  series.  What  is  the  full  Fourier  series? 
Explicitly  characterize  the  values  of  x  E  R  where  each  converges  pointwise. 

7.  Consider  the  function  (signal)  defined  by  /(x)  =  x+1  for  —  2tt  <  x  <  0  and 
/(x)  =  x  for  0  <  x  <  27 r,  and  otherwise  47 r  periodic.  Sketch  a  graph  of  the 
signal  and  find  its  Fourier  series.  Find  and  graph  the  frequency  spectrum. 
To  what  value  does  the  Fourier  series  converge  at  x  =  0?  At  x  =  27 r?  At 
x  =  7 r?  Graph  the  sum  of  the  first  four  harmonics  and  observe  how  well  it 
approximates  /. 

8.  Show  that  for  all  x  in  (0, 1), 
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COS  7TX 
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9.  Show  that 


ex  _  Slntl  71  (~1)  cmx 


i  TL 


7 r  1  —  in 

n=  —  oo 


for  7T  <  x  <  7T.  Write  the  series  in  real  quantities. 

10.  Show  that  the  Fourier  series  for  /(x)  =  cosax  on  [ — tt,  tt] ,  where  a  is  not 
an  integer,  is 
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2asina7r  /  1 
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cos  nx 
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Show 
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11.  Let  /(x) 


I  on 


7T  <  x  <  0  and  /(x)  =  A  on  0  <  x  <  tt.  Show  that  the 


Fourier  series  for  /  is 
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2 

(2  n  —  1)7 r 


sin(2n 


If  s jv(x)  denotes  the  sum  of  the  first  TV  terms,  sketch  a  graph  of  si(x), 
53 (x),  57 (x),  and  sio(x)  and  compare  to  /(x).  Observe  that  the  approx¬ 
imations  overshoot  /(x)  in  a  neighborhood  of  x  =  0,  and  the  overshoot 
is  not  improved  regardless  of  how  many  terms  are  taken  in  the  approxi¬ 
mation.  This  overshoot  behavior  of  Fourier  series  near  a  discontinuity  is 
called  Gibbs  phenomenon.  The  series  does  converge  pointwise  to  /(x) 
on  (—7 r,  0)  and  on  (0, 7r.  At  x  =  0  it  converges  pointwise  to  the  average 
of  the  left  and  right  limits,  or  zero.  It  does  not  converge  uniformly  on  any 
interval  containing  x  =  0. 

12.  Show  that  the  Fourier  sine  series  of  /(x)  =  100  on  [0,  /]  is 


4 

100 


1  1 

sin(7rx//)  H —  sin(37 rx/Z)  H —  sin(57rx/Z)  + 
3  5 


Using  software,  plot  the  first  50  terms  of  the  series  on  the  scaled  interval 
0  <  x/l  <  1.  Observe  Gibbs  phenomena  at  the  points  x/l  =  0  and  x/l  =  1, 
and  calculate  the  percentage  of  the  maximum  overshoot  at  those  two  points. 
(Answer:  about  9%.) 

13.  Find  the  complex  form  of  the  full  Fourier  series  for  coshx  on  (—1,1).  Find 
the  real  form. 


4 

Partial  Differential  Equations 

on  Bounded  Domains 


This  chapter  treats  a  standard  and  important  method  called  separation  of 
variables,  or  the  method  of  eigenfunction  expansions,  for  solving  partial 
differential  equations  on  bounded  spatial  domains.  This  method,  due  to  Fourier 
in  the  early  1800s,  with  contributions  by  Euler  and  D’Alembert  in  the  late 
1700s,  is  fundamental.  In  fact,  many  textbooks  emphasize  this  method  over  all 
others  because  of  its  extensive  applications  in  physics  and  engineering  where 
most  problems  are  on  bounded  spatial  domains.  In  a  nutshell,  the  essential 
feature  of  the  method  is  the  replacement  of  the  partial  differential  by  a  set  of 
ordinary  differential  equations  which  then  are  solved  subject  to  given  initial 
and  boundary  conditions.  To  fix  the  idea,  assume  the  unknown  function  in  the 
PDE  is  a  u  —  u(x,t),  where  the  independent  variables  are  x  and  t.  We  make 
the  assumption  that  u  can  be  written  as  a  product  of  a  function  of  x  and  a 
function  of  t,  that  is, 

u(x,t)  =  y(x)g{t). 

If  the  method  is  to  be  successful,  when  this  product  is  substituted  into  the  PDE, 
the  PDE  separates  into  two  ODEs,  one  for  y(x)  and  one  for  g(t).  Substitution 
of  the  product  into  the  boundary  conditions  leads  to  boundary  conditions  on 
the  function  y(x).  Therefore,  we  are  faced  with  a  spatial  ODE  boundary  value 
problem  for  y(x)  and  a  temporal  ODE  problem  for  g(t).  When  the  equations  for 
y(x)  and  g(t)  are  solved,  we  can  form  a  product  solution  u(x,t)  =  y{x)g(t)  of 
the  PDE  that  satisfies  the  boundary  conditions.  The  boundary  value  prob¬ 
lem  we  obtain  for  y{x)  is  a  special  type  of  eigenvalue  problem  called  a 
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Sturm-Liouville  problem  and  it  has  infinitely  many  solutions.  Consequently,  we 
will  have  infinitely  many  product  solutions  u\(x,t),  U2(x,  £),  U3  (x,  £),...  that 
satisfy  the  boundary  conditions.  By  superimposing  these  solutions,  or  adding 
them  up  in  a  special  way,  we  determine  a  solution  of  the  PDE  and  boundary 
conditions  that  also  satisfies  the  initial  condition(s).  In  other  words,  we  form 
the  series 

u(x,  t)  =  c\U\(x,  t)  +  C2U2 (x-)  t)  +  03^3(0;,  £)  +  ••• 

and  choose  the  constants  cn  such  that  the  sum  satisfies  the  initial  condition(s) 
as  well.  The  result  of  the  calculation  is  an  infinite  series  representation  of  the 
solution  to  the  the  original  initial  boundary  value  problem  for  the  PDE. 

Section  1  amplifies  the  preceding  remarks  to  fully  motivate  the  method. 
Then,  Section  2  presents  the  basics  of  Sturm-Liouville  eigenvalue  problems,  and 
the  final  sections  of  the  chapter  treat  many  important  examples  that  occur  in 
engineering  and  physics.  Many  references  discuss  these  methods  in  detail;  par¬ 
ticularly  we  mention  Churchill  (1963),  and  subsequent  editions,  Strauss  (1992), 
Farlow  (1993),  and  Haberman  (2013). 


4.1  Overview  of  Separation  of  Variables 

In  this  first  section  we  work  through  detailed  examples  of  Fourier’s  method  in 
the  contexts  of  heat  transfer  and  wave  motion. 

Example  4.1 

We  revisit  the  basic  idea  of  Fourier  that  we  discussed  in  Section  3.1,  which 
motivated  the  study  of  orthogonal  expansions  and  Fourier  series.  Consider  the 
initial  boundary  value  problem  for  heat  conduction, 

Ut  =  Uxx,  0  <  X  <  7T,  t  >  0,  (4.1) 

u(0,  t)  =  u(tt,  t)  =  0,  t>  0,  (4-2) 

u(x,0)  =  f(x),  0  <  X  <  7T.  (4.3) 

The  separation  of  variables  method  consists  of  looking  for  solutions  in  the  form 
of  products,  i.e., 

u(x,t)  =  y(x)g{t). 

Substituting  this  product  into  the  PDE  (4.1)  and  boundary  conditions  (4.2), 
we  obtain 


y{x)g'{t)  =  y"(x)g(t ),  y(0)g(t)  =  0,  y(n)g(t)  =  0. 
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Because  g(t)  is  not  the  zero  function,  we  can  write  the  ODE  and  boundary 
conditions  as 


g'{t)  =  y"(x) 
g(t )  y{x) 


-A,  y(0)  =  y(  tt)  =  0, 


for  some  yet  to  be  determined  constant  A.  This  is  valid  because  the  only  way 
a  function  of  t  can  equal  a  function  of  x  for  all  t  and  #,  is  if  both  are  equal 
to  the  same  constant.  The  yet  unknown  constant  —A  is  called  the  separation 
constant;  placing  a  minus  sign  on  the  constant  is  for  convenience  and  does  not 
mean  that  it  is  negative.  Therefore,  we  obtain  an  ordinary  differential  equation 
for  g  in  the  time  domain,  namely, 


=  -A  g(t), 


and  we  obtain  a  boundary  value  problem  for  y  in  the  spatial  domain,  namely, 

—y"(x)  =  A y(x),  0  <  x  <  7r,  (4-4) 

2/(0)  =  0,  y( 7r)  =  0.  (4.5) 

Therefore,  the  PDE  problem  separated  into  two  ODE  problems,  one  for  g(t) 
and  one  for  y(x).  We  can  easily  solve  the  the  DE  for  g\  leaving  out  the  arbitrary 
multiplicative  constant,  its  solution  is 

g{t)  =  e~xt. 


Next  we  solve  the  boundary  value  problem,  which,  as  we  later  observe, 
is  a  special  case  of  a  Sturm-Liouville  problem:  It  is  obvious  that  y{x)  =  0 
is  a  solution  to  this  problem;  but  we  are  interested  in  nontrivial  solutions. 
The  question  is:  for  which  values  of  the  separation  constant  A  are  there 
nontrivial  solutions ?  (For  example,  if  A  =  2  then  the  general  solution  is 
y(x)  =  Aco$y/2x  +  B  sin\/2a:  which  is  not  zero  at  x  =  0  and  x  =  n  unless 
A  =  B  =  0.)  One  way  to  determine  such  values  of  A  is  to  write  the  general 
solution  of  (4.4)  and  go  through  a  case  argument  by  separately  considering 
A  =  0,  A  0,  and  A  <  0.  We  examine  different  cases  because  the  general  solu¬ 
tion  of  (4.4)  has  a  different  form  depending  on  the  sign  of  A.  (We  prove  later 
that  A  cannot  be  a  complex  number.)  If  A  =  0,  then  the  ODE  (4.4)  has  the 
form  y"  =  0,  whose  general  solution  is  the  linear  function  y{x)  =  Ax  +  B.  But 
the  boundary  condition  y(0)  =  0  implies  B  =  0,  and  then  y( tt)  =  0  implies 
A  =  0.  Hence,  in  this  case  we  get  only  the  uninteresting  trivial  solution  and  so 
A  =  0  is  not  a  possible  value.  If  A  <  0,  say  for  definiteness  A  =  —a2,  then  the 
ODE  (4.4)  has  the  form  y"  —  a2y  =  0,  which  has  a  general  solution 


y(x)  =  Aeax  +  Be~ax 
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The  boundary  conditions  (4.5)  force 

2/(0)  =  A  +  B  =  0, 

y(  7r)  =  Aea7r  +  Be~a7T  =  0. 

It  is  easy  to  see  that  these  two  equations  for  A  and  B  force  A  =  B  =  0,  and 
again  we  obtain  only  the  trivial  solution.  Thus,  we  must  have  A  >  0,  or  A  =  a2. 
In  this  case  the  ODE  (4.4)  takes  the  form 

y"  +  a2y  =  0, 


which  has  solutions  built  from  sines  and  cosines: 

y(x)  =  A  cos  ax  A  B  sin  ax. 

First,  7/(0)  =  0  forces  A  =  0.  Thus  y{x)  =  B  sin  ax.  The  right-hand  boundary 
condition  yields 

y( 7r)  =  B  sinmr  =  0. 

But  now  we  are  not  required  to  take  B  —  0;  rather,  we  can  select  a,  and  hence 
the  unknown  A,  to  make  this  equation  hold.  Clearly,  because  the  sine  function 
vanishes  at  multiples  of  7r,  we  have  a  =  n,  a  nonzero  positive  integer,  or 

2 

A  —  A fi  —  n  ,  n  —  1, 2, .... 

These  are  the  values  of  A  that  lead  to  nontrivial  solutions 


y  =  yn{pc)  =  sinnx,  n  =  l,2, .... 


Here  we  arbitrarily  selected  the  constant  B  =  1;  but  we  can  always  multiply 
this  solution  by  any  constant  to  get  another  (not  independent)  solution. 

These  special  values  of  A  for  which  (4.4-4. 5)  has  a  nontrivial  solution  are 
called  the  eigenvalues  for  the  problem  and  the  corresponding  solutions  yn(x) 
are  called  the  eigenfunctions.  For  example,  A  =  1  has  corresponding  eigen¬ 
function  yi(x)  =  sinx,  A  =  4  has  corresponding  eigenfunction  yi(x)  =  sin2x, 
and  so  on.  Observe  further  that  the  eigenfunctions  form  an  orthogonal  system 
on  the  interval  [0,  tt]  ;  the  eigenfunctions  we  found  are  the  basis  of  a  Fourier 
sine  series  studied  in  the  last  chapter. 

Next  we  return  to  the  time  equation  which  has  solutions  g(t)  =  ext .  Sub¬ 
stituting  the  known  values  of  A  =  An  gives 


9n(t ) 


-n2t 


n  =  1,2,.... 


Therefore,  product  solutions  to  the  PDE  and  boundary  conditions  (4. 1-4.2) 
are 

un(x,  t)  =  gn(t)yn(x)  =  e  sin  nx,  n—  1,2,.... 
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To  find  a  solution  that  also  satisfies  the  initial  condition  (4.3),  we  use  super¬ 
position  and  form  the  linear  combination 

oo 

u(x,  t)  =  bne~n  1  sin  mg  (4.6) 

n—  1 

where  the  bn  are  arbitrary  constants.  Equation  (4.3)  implies 

oo 

u(x,  0)  =  f(x)  =  bn  sin  nx. 

n—  1 

The  right  side  is  the  Fourier  sine  series  for  the  initial  function  f(x)  in  terms  of 
the  orthogonal  functions  sin  nx\  From  Chapter  3  we  conclude  that  the  coeffi¬ 
cients  bn  are  the  Fourier  coefficients  given  by 

2 

bn  =  —  /  f(x)sinnxdx.  (4.7) 

n  Jo 

In  summary,  the  solution  to  initial  boundary  value  problem  (4.1),  (4.2),  (4.3) 
is  given  by  the  infinite  series  (4.6)  where  the  bn  are  given  by  (4.7).  □ 

This  is  the  method  of  separation  of  variables,  and  it  applies  to  most  all  of 
the  problems  on  bounded  domains  in  this  text.  For  the  method  to  work: 

1.  The  boundary  conditions  must  be  homogeneous. 

2.  The  PDE  must  be  homogeneous. 

Later  in  this  chapter  we  indicate  how  to  deal  with  nonhomogeneous  problems. 

Example  4.2 

We  apply  the  procedure,  but  with  less  detail,  to  the  initial  boundary  value 
problem  for  the  heat  equation 


ut  =  kuxx ,  0  <  x  <  /,  t  >  0,  (4.8) 

u(0,t)=0,  u(l,t)  =  0,  t  >  0,  (4.9) 

u(x,0)  =  f(x),  0  <  x  <  Z,  (4-10) 

where  we  have  included  a  diffusivity  k  in  the  PDE  and  the  spatial  interval  is 
[0,  /]  instead  of  [0,  tt  . 

Step  one  is  to  assume  a  solution  of  the  form  u(x,  t )  =  y(x)g(t)  and  substitute 
it  into  the  PDE  (4.8)  and  boundary  conditions  (4.9).  Substituting  into  the  PDE 
gives 


y(x)g'(t )  =  ky"(x)g(t), 
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or,  upon  dividing  by  kyg , 

g'(t)  =  y"(x) 
kg{t)  y{x)  ' 

Notice  that  the  variables  in  the  equations  have  been  separated — the  left  side  is 
a  function  of  t,  and  the  right  side  is  a  function  of  x.  Now,  when  can  a  function 
of  t  be  equal  to  a  function  of  x  for  all  x  E  (0,  /)  and  all  t  >  0?  Only  if  the  two 
functions  are  equal  to  the  same  constant,  which  again  we  call  —A.  That  is,  we 
must  have 

g'(t)  =  y"(x)  = 

kg(t)  ’  y(x) 

Therefore,  we  have  two  ODEs,  for  g  and  y: 

g'(t)  =  —Xkg(t),  -y"{x)  =  A  y(x). 

Also  observe  we  placed  the  constant  k  in  the  g  equation;  it  could  be  placed 
in  the  y  equation,  but  it  is  convenient  to  keep  the  y  equation  simple.  Next  we 
substitute  u(x,t)  =  y{x)g{t)  into  the  boundary  conditions  (4.9)  to  obtain 

y(0)g(t)  =  0,  y(l)g(t)  =  0. 

Excluding  the  uninteresting  possibility  that  g(t)  =  0,  we  get  2/(0)  =  0  and 
y(l)  =  0.  Therefore,  we  are  led  to  the  boundary  value  problem 

—y"(x)  =  \y{pc),  0  <  x  <  /,  (4T1) 

2/(0)  =  0,  2/(/)  =  0.  (4.12) 

This  is  an  ODE  boundary  value  problem  for  y.  Exactly  as  in  Example  4.1,  the 
eigenvalues  A  are  positive.  When  A  =  a2,  the  problem  becomes 

y”  +  a2y  =  0,  y{ 0)  =  0,  y{l)  =  0. 


The  general  solution  is 


y(x)  =  A  cos  ax  +  B  sin  ax. 


Now  2/(0)  =  0  forces  A  =  0  and  then  y(l)  =  0  forces  B  sin  al  =  0.  We  do  not 
want  to  take  B  =  0,  so  we  get  al  =  utt,  or  a  =  nn/l.  Therefore,  the  eigenvalues 
are 

77/27T2 

x  =  xn  =  — n  =  1,2,...,  (4.13) 

and  the  corresponding  solutions,  or  eigenfunctions,  to  the  BVP  are 


Vn{x) 


TiTTX 

sm 


n  =  1,2, 


(4.14) 


l 


1 
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To  reiterate,  the  eigenvalues  are  those  values  of  A  for  which  the  problem  (4.11- 
4.12)  has  a  nontrivial  solution;  the  eigenfunctions  are  the  corresponding  non¬ 
trivial  solutions. 

The  next  step  is  to  solve  the  time  equation  for  g(t).  We  easily  get 

g  (t)  =  —Xkg(t)  =^>  g(t)  =  e~Xkt  =  e~n  71  kt ^  . 


Now  we  put  together  the  preceding  results.  We  have  constructed  infinitely  many 
product  solutions  of  the  PDE  (4.8)  having  the  form 


un(x,t)  =  e 


_  —n2ir2kt/l 


,2  TiTTX 

sin  — - — 


n 


1  2 

_L  }  AJ  y  •  •  •  • 


These  product  solutions  also  clearly  satisfy  the  boundary  conditions  (4.9),  but 
they  do  not  satisfy  the  given  initial  condition  (4.10).  So  the  next  step  is  to 
determine  the  constants  cn  such  that  the  linear  combination 


oo 

u(x,  t )  =  cne 

n=  1 


n27r2kt/l2 


T17TX 

sin  — - — 


(4.15) 


satisfies  the  initial  condition  (4.10).  Thus  we  require 


oo 

u(x,  0)  =  f(x)  =  ^2  °n  sin 

n—  1 


But  the  right  side  is  just  the  Fourier  sine  series  of  the  function  f(x)  on  the 
interval  (0,/).  Therefore,  the  coefficients  cn  are  the  Fourier  coefficients  given 
by  (see  Section  3.3) 


c 


n 


nnx 

j(x)  sm — - — ax,  n  =  l,2, .... 

L 


(4.16) 


Therefore,  we  have  obtained  a  solution  to  (4.8-4.10)  given  by  the  infinite  series 
(4.15)  where  the  coefficients  cn  are  given  by  (4.16).  If  the  function  /  is  com¬ 
plicated,  then  a  simple  formula  for  the  Fourier  coefficients  cn  cannot  be  found, 
and  we  must  resort  to  numerical  integration.  □ 


Remark  4.3 

We  can  obtain  a  particularly  insightful  formula  for  the  solution  in  Example  4.2. 
Substituting  the  expression  for  the  cn  into  the  solution  formula  (4.15)  allows 
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us  to  write  the  solution  in  a  different  way  as 


u(x,  t) 


£  G  L m 


sin  dt;  )  e  n‘^ktll 


l 


2 _ 2  7„j.  /  ,2  n 7TX 

sin  — - — 


0 


'  '  jjr  e~Mt/12  sin  Hi  sin  |  /(£) 


n=l 


1 


1 


=  f  g(x,€,t)f(€)d£. 
J  0 


The  function  g{pc,  £,  t)  is  the  expression  in  parentheses  and  is  called  the  Green’s 
function  for  the  problem.  This  formula  shows  the  form  of  the  solution  as 
an  integral  operator  acting  on  the  initial  temperature  distribution  function  /. 
Similar  to  the  discussion  in  Chapter  2  on  the  diffusion  equation,  we  can  regard 
g(x,£,t)f(£)  as  the  temperature  response  at  x,  at  time  £,  of  the  system  due  to 
a  local  source  /(£)  at  the  point  £  at  time  t  =  0;  then  the  formula  above  adds  up 
the  contribution  of  all  of  the  sources  in  the  interval.  In  this  way,  the  function 
#(#,£,£)  can  be  interpreted  as  a  source  function,  or  the  response  to  a  ‘point 
source.’  This  may  remind  more  advanced  readers  of  delta  functions.  □ 


Once  a  series  representation  for  the  solution  is  found,  the  work  is  not  over 
if  we  want  to  find  temperature  profiles  or  the  solution  surface.  Because  we 
cannot  sum  an  infinite  series,  we  often  take  the  first  few  terms  as  an  approx¬ 
imation.  Such  approximations  are  illustrated  in  an  example  below  and  in  the 
exercises.  For  the  heat  equation,  taking  a  two  or  three  term  approximation  is 
very  accurate  because  of  the  decaying  exponential  in  the  series  (4.15). 


Example  4.4 

Consider  the  initial  boundary  value  problem  for  the  heat  equation: 

rq  —  'U'xx'j  0  x  <C  1,  t  0, 
u(Q,t)  =  u(l,t)  =  0,  t  >  0, 
u(x,  0)  =  10x3(l  —  x),  0  <  x  <  1. 

Here,  k  =  /  =  1  and  the  initial  temperature  is  f(x)  =  10x3(l  —x).  The  solution 
is  given  by  (4.15)  with  the  coefficients  (4.16)  in  Example  4.2.  Specifically, 

cn  =  20  /  x3(l  —  x)  sin(mnr)  dx. 

J  0 

Using  a  computer  algebra  system  or  calculator,  one  finds 


ci  =  0.7331,  C2 


0.4838,  c3  =  0.1304. 
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A  3-term  approximate  solution  is  therefore 

u(x,  t)  ~  cie_7r  1  sin(7rx)  +  C2e~47T  1  sin(27nr)  +  c^e~9lT  1  sin(37nr). 
Figure  4.1  plots  four  time  profiles  showing  how  the  bar  cools  down.  □ 


Figure  4.1  Temperature  profiles  in  Example  4.4  at  different  times.  As  t  0 
the  temperature  approaches  the  steady  state  solution,  u  =  0 


The  separation  of  variables  procedure  described  above  can  be  imitated  for  a 
large  number  of  problems.  Next  we  illustrate  the  method  for  the  wave  equation, 
again  with  less  detail. 

Example  4.5 

Recall  that  the  wave  equation  models  the  small  deflections  of  an  elastic  string 


fixed  at  both  ends  with  an  initial  position  and  velocity  given  at  time  t  =  0.  We 
consider  the  problem 

utt  =  c2uxx,  0<x<l,t>0,  (4-17) 

i/(0,  t)  =  0,  u(l,t)  =  0,  t  >  0,  (4-18) 

u(x,Q)  =  F(x),  Ut(x,  0)  =  G(x),  0  <  x  <  /,  (4.19) 
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where  F  and  G  are  the  initial  position  and  velocity  of  the  string.  Assuming 
u(x,t)  =  y(x)g(t)  and  substituting  into  the  PDE  (4.17)  yields 


V{x)g"{t)  =  c2y"{x)g{t ), 


or,  upon  dividing  by  c2yg. 


9"(t)  y"(x) 


c2g(t) 

Setting  each  term  equal  to  a  constant 


y(x)  ' 

A,  we  obtain  the  two  ODEs 


g  ( t)  +  c  A g(t)  =  0,  -y  (. x )  =  A y(x). 

Next  we  substitute  u(x,t)  =  y{x)g(t)  into  the  boundary  conditions  (4.18)  to 
get 

y(o)g(t )  =  o,  y(l)g(t)  =  o, 

which  gives  y(0)  =  0  and  y{l)  =  0.  Therefore,  we  are  led  to  the  boundary  value 
problem 

-y"{x)  =  A y(x),  0  <  x  <  l;  y( 0)  =  0,  y(l)  =  0. 


This  boundary  value  problem  is  exactly  the  same  as  in  the  heat  flow  example 
above.  The  eigenvalues  and  eigenfunctions  are  (see  (4.13-4.14)) 


Art  — 


n2  7r2 


l 2 


,  yn(x)  =  sin 


Tl  7TX 


,  n  =  1,2.... 


(4.20) 


Solving  the  ODE  for  g(t ), 


/  \  /  x  ^  nnct  ^  nirct 

g(t)  =  gn{t)  =  C  sm  — ; - b  D  cos 


l 


l 


Therefore,  we  have  constructed  infinitely  many  product  solutions  of  the  PDE 
(4.17)  having  the  form 

,  N  (  nnct  nnct  \  nnx 

i^n(x,  t)=[cn  sm  — - - b  dn  cos  — - —  1  sm  — - — ,  n  =  1,2,..., 


I 


l 


where  we  have  replaced  the  arbitrary  constants  C  and  D  by  arbitrary  constants 
cn  and  dn  depending  on  n.  These  product  solutions  un(x,  t)  represent  modes  of 
vibrations.  The  temporal  part  is  periodic  in  time  with  period  2 l/(nc);  the  spa¬ 
tial  part  has  frequency  2 l/n.  These  product  solutions  also  satisfy  the  boundary 
conditions  (4.18)  but  do  not  satisfy  the  given  initial  conditions  (4.19).  So  we 
form  the  linear  combination 


oo 


U 


(x,t)  = 


nnct  nnct  \  nnx 

cn  sin  — - b  dn  cos  — : —  I  sin 


n— 1 


i 


i  J 


i 


(4.21) 
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and  select  the  constants  cn  and  dn  such  that  the  initial  conditions  hold.  Thus 
we  require 


oo 

a(x,  0)  =  F(x)  =  dn  sin 

n=  1 


T17TX 

~r 


The  right  side  is  the  Fourier  sine  series  of  the  function  Fix)  on  the  interval 
(0,1).  Therefore,  the  coefficients  dn  are  the  Fourier  coefficients  given  by 


d 


n 


2 

7 


0 


TITTX 

F[x)  sm  — - — ax, 

i 


To  apply  the  other  initial  condition  rq(x,  0) 
time  derivative  of  u(x,t).  We  obtain 


n  —  1,2,....  (4.22) 

G(x)  we  need  to  calculate  the 


TlTTCt 

cos  — - — 


7  nnct  \  nnx 

dn  sm  — - —  1  sm  — - — 

L  J  i 


Thus 


iq(x,  0)  =  G(x)  = 


E 


ncn 

~l  cn  sm 


T17TX 

l 


Again,  the  right  side  is  the  Fourier  sine  series  of  G  on  (0,  /),  so  the  coefficients 
of  sin  which  are  I1jLcn,  are  the  Fourier  coefficients;  that  is, 


ncn 


n(  \  •  nnx 
G(x)sm — - — ax, 
l  .1  o  l 


or 


Therefore,  the  solution  of  the  initial  boundary  value  problem  is  given  by  the 
infinite  series  (4.21)  where  the  coefficients  are  given  by  (4.22)  and  (4.23). 

Again,  numerical  calculations  on  a  computer  algebra  system  or  a  scien¬ 
tific  calculator  can  be  used  to  determine  a  few  of  the  coefficients  to  obtain  an 
approximate  solution.  However,  for  the  wave  equation  several  terms  may  be 
required  for  accuracy  because,  unlike  the  heat  equation,  there  is  no  decaying 
exponential  in  the  solution  (4.21)  to  cause  rapid  decay  of  the  terms  in  the  series. 


d 


ncn 


o 


G(x)  sin  ^-^dx,  n  =  l,2, 

i 


(4.23) 


□ 


What  we  have  described  is  the  separation  of  variables  method,  and  the  pro¬ 
cedure  can  be  imitated  on  a  large  number  of  initial  boundary  value  problems. 
Each  problem  leads  to  a  boundary  value  problem  for  an  ODE  that  has  infinitely 
many  solutions  that  form  an  orthogonal  system.  In  the  next  section  we  focus 
attention  on  the  ODE  boundary  value  problem,  called  a  Sturm-Liouville  prob¬ 
lem. 
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EXERCISES 

1.  Solve  the  heat  flow  problem  with  both  boundaries  insulated: 

Ut  —  kuxx'j  0  <C  x  <C  /,  t  0, 
ux(0,t)  =  ux(l,t)  =  0,  t  >  0, 

37 TX 

u(x ,  0)  =  1  +  2  cos  — — ,  0  <  x  <  l. 

L 

2.  In  the  heat  flow  problem  (4. 1-4.3)  take  k  =  1,  l  =  7r,  and  f(x)  =  0 
if  0  <  x  <  7t/2,  f(pc)  =  1  if  7t/2  <  x  <  i r.  (a)  Find  an  infinite  series 
representation  of  the  solution,  (b)  Use  the  first  four  terms  in  the  series  to 
obtain  an  approximate  solution,  and  on  the  same  set  of  coordinate  axes 
sketch  several  time  snapshots  of  the  approximate  temperature  distribution 
in  the  bar  in  order  to  show  how  the  bar  cools  down,  (c)  Estimate  the  error 
in  these  approximate  distributions  by  considering  the  first  neglected  term, 
(d)  What  is  limt^0+  u(tt/2 ,  £)?  Comment. 

3.  In  the  vibration  problem  (4.17-4.19)  take  c  =  1,  l  =  7r,  and  F(x)  =  x  if 
0  <  x  <  7t/2,  and  F(x)  =  tt  —  x  if  tt/2  <  x  <  7r,  and  take  G(x)  =  0.  (a) 
Find  an  infinite  series  representation  of  the  solution,  (b)  Use  the  first  four 
terms  in  the  series  to  obtain  an  approximate  solution,  and  on  the  same  set 
of  coordinate  axes  sketch  several  time  snapshots  of  the  wave,  (c)  Can  you 
estimate  the  error  in  these  approximate  wave  forms? 

4.  The  initial  boundary  value  problem  for  the  damped  wave  equation, 

Utt  +  kut  =  c2uxx,  0  <  X  <  1,  t  >  0, 
n(0,t)  =  0,  n(l,t)=0,  t  >  0, 
u(x,  0)  =  /(#),  Ut(x,  0)  =0,  0  <  x  <  1, 

governs  the  displacement  of  a  string  immersed  in  a  fluid.  The  string  has 
unit  length  and  is  fixed  at  its  ends;  its  initial  displacement  is  /,  and  it 
has  no  initial  velocity.  The  constant  k  is  the  damping  constant  and  c  is 
the  wave  speed.  Use  k  =  2,  c  =  1  and  apply  the  method  of  separation  of 
variables  to  find  the  solution. 

5.  Consider  heat  flow  in  a  rod  of  length  l  where  the  heat  is  lost  across  the 
lateral  boundary  is  given  by  Newton’s  law  of  cooling.  The  model  is 

Ut  =  kuxx  —  hu ,  0  <  x  <  Z, 
u  =  0  at  x  =  0,  x  =  /,  for  alH  >  0, 
u  =  f(x)  at  t  =  0,  0  <  x  <  /, 

where  h  >  0  is  the  heat  loss  coefficient. 
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a)  Find  the  equilibrium  temperature. 

b)  Solve  the  problem. 


4.2  Sturm-Liouville  Problems 

In  this  section  we  examine  a  broad  class  of  initial  boundary  value  problems  of 
the  form 


=  (p(x)ux)x  —  q(x)u ,  a  <  x  <  b,  t  >  0, 

(4.24) 

a\ u(a,  t )  +  a2ux(a ,  t)  =  0, 

(4.25) 

P\u(b,t)  +  P2ux(b,t)  =  0, 

(4.26) 

u(x,  0)  =  /(#),  a  <  x  <  b. 

(4.27) 

The  standard  boundary  conditions  (i.e.,  Dirichlet,  Neumann,  and  radiation) 
are  included  in  (4.25-4.26)  as  special  cases  by  choosing  appropriate  values  of 
the  constants. 

When  we  assume  a  separable  solution  u(x,  t)  =  y(x)g(t ),  we  obtain  an  ODE 
for  y  =  y(x)  of  the  form 

-(p(x)y')'  +  q{x)y  =  Ap,  a  <  x  <  6, 

where  A  is  the  separation  constant  (see  Exercise  1).  This  type  of  ODE  is  called 
a  Sturm-Liouville  differential  equation.  The  homogenous  boundary  con¬ 
ditions  on  the  PDE  transform  into  boundary  conditions  on  y ,  namely, 

otiy(a)  +  a2y\a)  =  0,  pi  y(b)  +  /32y'(b)  =  0. 

Solving  this  important  boundary  value  problem  for  the  function  y{x)  on  a 
bounded  interval  [a,  b }  is  the  subject  of  a  large  portion  of  this  section  and  the 
basis  for  the  separation  of  variables  method.  We  state  the  problem  in  a  general 
form. 


Definition  4.6 


Let  p,  p',  and  q  be  continuous  functions  on  the  bounded  interval  [a,  6],  and 
p(x)  0  for  all  x  E  [a,  fo] ;  assume  further  that  a\  +  7^  0  and  ^  0. 

A  regular  Sturm-Liouville  problem  (SLP)  is  the  problem  of  determining 
values  of  the  constant  A  for  which  the  boundary  value  problem 


-(p(x)y')'  +  q(x)y  =  A p,  a  <  x  <6, 

OL\y(a)  +  a2  y\a)  =  0, 

Pi  y(b)  +  @2  y\b)  =  0. 


(4.28) 

(4.29) 

(4.30) 
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has  a  nontrivial  solution  y  =  y{x).  By  a  solution  we  mean  a  function  that  is 
continuous  and  has  a  continuous  derivative.  We  refer  to  these  types  of  problems 
as  (differential)  eigenvalue  problems.  □ 


If  the  interval  [a,  b]  is  infinite,  or  if  p(x o)  =  0  for  some  Xo  E  [a,  b]  the  SLP  is 
called  singular.  In  most  problems,  p  is  positive.  Different  types  of  boundary 
conditions  also  lead  to  singular  problems.  SLPs  are  named  after  J.  C.  F.  Sturm 
and  J.  Liouville,  who  studied  them  in  the  mid  1830s.  The  condition  that  the 
constants  a\  and  are  not  both  zero,  and  the  constants  /3iand  /?2  are  not  both 
zero,  guarantees  that  a  boundary  condition  does  not  collapse  at  an  endpoint. 

As  we  saw  in  Example  4.1,  a  regular  SLP  need  not  have  a  nontrivial  solution 
for  some  values  of  the  constant  A.  (Clearly,  the  zero  function  y(x)  =0  is  always  a 
solution,  but  there  is  no  interest  in  it).  A  value  of  A  for  which  there  is  a  nontrivial 
solution  of  (4.28-4.30)  is  called  an  eigenvalue,  and  the  corresponding  nonzero 
solution  is  called  an  eigenfunction.  Observe  that  any  constant  multiple  of  an 
eigenfunction  is  also  an  eigenfunction  corresponding  to  the  same  eigenvalue;  but 
it  is  not  independent.  The  important  fact  about  regular  SLPs  is  that  they  have 
an  infinite  number  of  eigenvalues,  and  the  corresponding  eigenfunctions  form  a 
complete,  orthogonal  set,  which  makes  orthogonal  expansions  possible.  This  is 
a  key  idea  in  applied  mathematics,  and  it  especially  applies  to  the  separation 
of  variables  method,  which  is  the  origin  of  the  entire  subject  of  Fourier  series 
and  orthogonal  expansions. 

Before  proving  some  general  facts  about  regular  SLPs,  we  give  an  additional 
example  with  mixed  boundary  conditions.  In  this,  and  subsequent  examples, 
we  write  y  for  t/(x),  y'  for  t/(x ),  etc.,  because  the  independent  variable  x  is 
understood. 


Example  4.7 

Consider  the  regular  SLP 

-y"  =  A y,  0  <  x  <  1, 

2/7(0)  =  0,  2/(1)  =0. 

We  carry  out  a  case  argument  to  determine  the  possible  eigenvalues.  Exactly  as 
in  previous  examples,  we  find  that  A  cannot  be  negative  or  zero.  If  A  =  a2  >  0 
then  the  general  solution  of  the  ODE  is  y{x)  =  A  cos  ax  +  B  sin  ax.  Now, 
t/(0)  =  aB  =  0,  giving  B  =  0;  hence  y(x)  =  A  cosax  and  7/(1)  =  A  cos  a  =  0. 
There  is  no  need  to  take  A  =  0,  so  we  take  cos  a  =  0,  and  thus  a  must  be 
7r/2  +  77/ 7T,  n  =  0, 1,  2, ... .  Therefore  the  eigenvalues  are 

An=(f  +  mr)  =  (^(2n  +  1))  ’  n  =  0’1,2,.... 
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The  corresponding  eigenfunctions  are 

yn{x)  =  cos  ^(2n  +  ,  n  =  0, 1,  2, - 

One  can  verify  that  the  eigenfunctions  form  an  orthogonal  system  on  the  inter¬ 
val  [0, 1].  □ 


We  can  always  carry  out  a  case  argument  as  in  the  preceding  example 
to  determine  the  eigenvalues,  their  signs,  and  corresponding  eigenfunctions. 
For  many  equations  this  can  be  tedious,  and  so  it  is  advantageous  to  have 
some  general  results  that  allow  us  to  reject  or  affirm  certain  cases  immediately, 
without  detailed  calculation.  This  is  discussed  in  the  next  section.  For  the 
present  we  state  the  central  result  regarding  regular  SLPs. 


"heorem  4.8 


For  a  regular  Sturm-Liouville  problem  (4.28-4.30)  the  following  hold: 

1.  There  are  infinitely  many  discrete  eigenvalues  An,  n  —  1,2, . . .,  and  the 
eigenvalues  are  real  with  hmrWOO  |An|  =  Too. 

2.  Eigenfunctions  corresponding  to  distinct  eigenvalues  are  orthogonal. 

3.  An  eigenvalue  can  have  only  a  single  independent  eigenfunction. 


4.  The  set  of  all  eigenfunctions  yn[x)  is  complete  in  the  sense  that  every 
square-integrable  function  /  on  [a,  b }  can  be  expanded  in  a  generalized 
Fourier  series 

(VnJ) 


OO 


f  (x)  =  ^2  cnVn{x) ,  Cn 


n=  1 


II  Un 


12  ’ 


which  converges  to  /  in  the  mean-square  sense  on  [a,  b].  □ 


It  is  beyond  our  scope  to  prove  the  existence  of  the  eigenvalues  (see,  for 
example,  Birkhoff  and  Rota  (1989)).  But  it  is  straightforward  to  demonstrate 
(2)  and  (3).  First  we  introduce  some  notation  that  streamlines  writing  the 
formulas. 

Notation.  Let  L  denote  the  second-order  differential  operator  defined  by  the 
left  side  of  the  Sturm-Liouville  differential  equation: 

Ly  =  —(p(x)y')'  +  q(x)y ,  a  <  x  <  b. 

(It  is  common  to  write  Ly  and  not  L(y).)  So  the  SLP  can  be  written  simply  as 

Ly  =  A y,  a  <  x  <  6,  (4-31) 

ai  y(a)  +  a2  y'(a)  =  0,  (4.32) 

Pi  yip)  +/W(&)  =  0.  (4.33) 
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We  make  liberal  use  of  the  inner  product  of  two  functions: 


(yi,V2) 


Vi  {x)y2{x)dx. 


a 


Finally,  we  introduce  the  Wronskian  of  two  functions  ^1,^2,  defined  by 


W\y1,y2](x)  =  yi(x)y'2(x)  -  y[(x)y2(x). 


Notice  the  Wronskian  is  a  function  of  x,  and  we  usually  write  it  simply  as 
W(x)  if  the  two  functions  are  understood.  The  reader  may  be  familiar  with 
this  quantity  from  an  elementary  differential  equations  course. 

The  following  lemma  contains  two  extremely  important  identities  that  are 
used  in  the  calculations. 


Lemma  4.9 


Let  yi  and  1/2  be  two  C 1  functions  on  the  interval  [a,  b\.  Then, 

d 

V2Lyi  -  y\Ly2  =  —  [p(x)W(x)_ . 

This  identity  is  called  Lagrange’s  identity.  It  follows  that 

O2,  LVl)  -  (yi ,Ly2)  =  [p(x)W(x)]  \ba=  p(b)W(b)  -  p(a)W(a) 


This  is  Green’s  identity.  □ 


The  proof  of  Lagrange’s  identity  is  straightforward.  Just  take  the  derivative 
of  p(x)W(x)  and  verify,  using  the  product  rule,  that  it  coincides  with  the  left 
side: 


d_ 

dx 


\p(x)W(x)  = 


~T  pyiU2  ~r 

dxL  J  dx 

(py^Yvi+pyWi  -  {py[)'y2  -py[y2 

(py^'yi  -  (py'i)'y2 

-{py'i)'y2  +  qy\V2  +  (py2)'yi  -  qym 

[  -  (py'i)'  +  qyi]y2  -  [ -  (py2)'  +  qy2]yi  =  2/2-O/1  -  yiLy2. 


To  prove  Green’s  identity  integrate  both  sides  of  Lagrange’s  identity  over  [a,  b } 
and  use  the  inner  product  notation.  One  can  also  prove  Green’s  identity  directly 
using  integration  by  parts.  □ 

Now  we  demonstrate  the  proofs  of  various  parts  of  Theorem  4.8. 
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Orthogonality  of  eigenfunctions.  Let  Ai  and  A2  be  distinct  eigenvalues  of 
the  SLP  (4.31-4.33)  with  corresponding  eigenfunctions  yi(x)  and  y2(x).  By 
Green’s  identity, 

(yi,Ly±j  -  ( yi,Ly2 )  =  [y2,  Ayi)  -  (yi,\y2) 

=  (Ai  -  A2)  (2/1,  y2j 
=  p(b)W(b)-p(a)W(a). 

Now  we  calculate  W (a)  and  W ( b )  using  the  boundary  conditions.  Because  both 
y  1  and  y2  satisfy  the  first  boundary  condition  (4.32),  we  have 

OL\y\{a)  +  &2y[(a)  =  0, 

042/2(0)  +  ^2y'2(a)  =  0. 

These  expressions  can  be  regarded  as  a  homogenous  linear  system  in  two 
unknowns  aq  and  a2: 

{  yi(a)  y'lip)  W  «i  \  =  /  0 
V  Viia)  y2(a)  J  \  a2  J  v  0 

By  assumption,  a\  and  a2  are  not  both  zero.  Hence,  the  determinant  of  the 
coefficient  matrix  must  be  zero.  That  is,  yi(a)y2(a)  —  y[(a)yi(a)  =  W(a)  =  0. 
Similarly  W(b)  =  0.  Therefore, 

(Ai  —  A2^  [yi.y^  =  0. 

Because  Ai  7^  A2,  we  have  (yi,  y2^j  =  0,  which  means  yi  and  y2  are  orthogonal. 

□ 


The  eigenvalues  are  real.  Let  A  be  an  eigenvalue  with  corresponding  eigen¬ 
function  y.  Then  Ly  =  A y  and  y  satisfies  the  boundary  conditions  (4.32-4.33). 
Taking  the  complex  conjugate  of  the  differential  equation  we  get 

Ly  =  Xy 


and  the  comlex  conjugate  y  satisfies  the  boundary  conditions  as  well  (with  y 
replaced  by  y).  This  means  A,  y  is  also  an  eigenpair.  As  above,  the  Wronskian 
is  zero  at  x  =  a  and  x  =  b.  Applying  Green’s  identity  then  gives 


0  =  (y,Ly)  -  ( y,Ly ) 


(' y ,  A y)  -  ( y ,  A y)  =  (A  -  A )(y,  y) 


y{x)y{x)dx  =  /  |  y{x)  |2  dx. 


a 


a 


Because  y  is  not  identically  zero,  the  last  integral  is  positive.  So  A  =  A.  Because 
A  equals  its  complex  conjugate,  it  must  be  real.  □ 


172 


Bounded  Domains 


An  eigenvalue  A  has  a  unique  eigenfunction,  up  to  a  constant  mul¬ 
tiple.  Let  yi  and  7/2  be  two  eigenfunctions  associated  with  the  eigenvalue  A. 
Then  Ly\  =  \y\  and  Ly 2  =  A  7/2-  By  Lagrange’s  identity, 


0  =  V2Lyi  -  yiLy2 


d 


dx  . 


p(x)W  (x) 


This  means  p(x)W(x)  =  c  =  constant.  We  can  evaluate  the  constant  by  either 
of  the  boundary  conditions.  For  example, 


p(a)W  (a)  =  c  =  0 

because,  as  we  showed,  the  boundary  condition  forces  W(a)  =  0.  Hence, 
W(x)  =  2/12/2  —  2/i 2/2  =  0  for  all  x  in  [a,  b\.  This  is  the  same  as 

I  (®)  -  »'  <«4> 

or  yi  =  constant  •  2/2  •  Hence  y\  and  2/2  are  dependent  functions.  □ 

The  eigenfunctions  are  real.  Let  y  be  an  eigenfunction  corresponding  to  the 
real  eigenvalue  A.  If  y  is  complex- valued,  then  y  =  <p(x)  +  i'ip( x ),  where  </>  and 
ip  are  real-valued  functions.  By  the  linearity  of  L  and  the  boundary  conditions, 
it  follows  that  both  <p  and  ip  are  eigenfunctions  corresponding  to  A.  Then  <p 
and  ip  are  linear  dependent.  So  we  can  replace  y  with  one  of  these  real- valued 
eigenfunctions.  □ 


Example  4.10 

(Periodic  boundary  conditions)  Consider  the  simple  boundary  value  prob¬ 
lem 


— y"  =  A ?/,  —7 r  <  x  <  7r, 

y(-ir)  =  y{ w),  y'(-7r )  =  y'( tt). 


This  is  not  a  regular  SLP  because  the  boundary  conditions,  called  periodic 
boundary  conditions ,  are  of  a  different  type.  Nevertheless,  it  is  easy  to  check 
that  the  eigenvalues  are  A n  =  n2,  n  =  0,1,2,....  Corresponding  to  A  =  0  is 
a  single  eigenfunction  7/0  =  1,  and  corresponding  to  any  n  >  0  there  are  two 
linearly  independent  eigenfunctions,  cos  nx  and  sin  nx.  This  does  not  contradict 
the  theorem.  One  can  note  that  (4.34),  required  in  the  proof  of  uniqueness 
of  an  eigenfunction,  cannot  hold  in  this  case.  However,  it  remains  true  that 
p(x)W(x)  | =  0.  Hence  the  eigenvalues  are  real  and  the  corresponding  set  of 
eigenfunctions  form  a  complete  orthogonal  set.  This  set  of  eigenfunctions  form 
the  basis  of  the  classical  Fourier  series 


1 

2 


00 

ao + 


1 


a 


n 


cos  nx  +  bn  sin  nx. 


□ 
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Another  important  question  concerns  the  sign  of  the  eigenvalues.  We  can 
always  carry  out  a  case  argument  as  in  previous  examples,  but  often  an  energy 
argument  gives  an  answer  more  efficiently,  as  in  the  following  example. 


Example  4.11 

(Energy  argument)  Consider  the  Sturm-Liouville  differential  equation 

-y"  +  q(x)y  =  A y,  0  <  x  <  /, 
with  q(x)  >  0  and  mixed  boundary  conditions 

2/(0)  =0,  y'(l)  =  0. 


Let  A  be  an  eigenvalue  with  corresponding  eigenfunction  y  =  y(x).  Multiply 
the  differential  equation  by  y  and  integrate  to  get 

/  —yy"dx-\-  /  q(x)y2dx  =  A  /  y2(x)dx. 

Jo  Jo  Jo 

This  method  gets  its  name  from  the  expression 

d 


[  y2(x)dx  =  \\y\\2, 
JO 


which  is  the  energy  of  the  function  y.  (See  Section  3.3.)  Now  integrate  the  first 
integral  by  parts,  pulling  one  derivative  off  y"  and  putting  it  on  y: 

—yy"  dx  =  -y(x)y’{x)  \l0+  ( y’  fdx . 

Jo 

y(x)y'(x)  |o  +  /  (y')2  dx  +  /  q(x)y2dx  =  A  /  y2(x)dx . 

Jo  Jo  Jo 


Then 


'0  JO  JO 

The  boundary  conditions  force  the  boundary  term  to  be  zero,  and  therefore 

/  (y')2  dx  +  /  q(x)y2dx  =  A  /  y2(x)dx. 

Jo  Jo  Jo 

The  left  side  is  strictly  positive,  and  the  integral  on  the  right  is  positive.  Hence, 
A  >  0.  This  problem  has  only  positive  eigenvalues.  □ 


A  similar  argument  using  integration  by  parts  can  be  made  for  the  general 
Sturm-Liouville  problem  (4.28-4.30).  The  reader  can  show,  as  above,  that  if  A 
and  y  are  an  eigenvalue-eigenfunction  pair,  then 


fa  ( p(x)y ,2  +  q(x)y2  )dx  -  p(x)y(x)y'{x ) 


A  = 


b 

a 


\\y\ 


(4.35) 
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This  expression  for  the  eigenvalue  A  is  called  the  Rayleigh  quotient.  The 
boundary  term  in  the  numerator  involving  the  Wronskian  may  be  simplified  by 
the  SLP  boundary  conditions  (4.29-4.30). 

Example  4.12 

Consider  the  SLP 


y"  =  A y,  0  <  x  <  1, 

2/(0)  =0,  3/(1)  H-/33/i(l)  =  0,  (3>  0. 


Here  p(x)  =  1  and  q(x)  =  0.  Therefore,  if  A  and  y  are  an  eigenpair,  then  the 
Rayleigh  quotient  gives 


y,2dx  —  y(x)y\x) 

w 


1 

o 


But 


-y(l)y\l)  +  y(0)y'(0)  =  Py'(l)2  >  0. 
Hence,  the  eigenvalues  are  positive.  □ 


Next  we  solve  a  problem  with  a  radiation-type  boundary  condition. 


Example  4.13 

Consider  the  diffusion  problem 

Ut  =  kuxx ,  0  <  x  <  1,  t  >  0,  (4.36) 

u(0,  t)  =  0,  u(l,  t)  +  ux{l,  t)  =  0,  t  >  0,  (4.37) 

u(x,  0)  =  /(x),  0  <  x  <  1,  (4.38) 

where  the  left  end  is  fixed  and  the  right  end  satisfies  a  radiation-type  condition; 
note  that  if  u  >  0  at  x  =  1  then  ux  <  0  at  x  =  1,  so  heat  is  radiating  out  of 
the  bar.  We  assume  u(x,t)  =  y(x)g(t)  and  substitute  into  the  PDE  to  obtain 

g'(t)  =  y"  0)  = 

kg(t)  y{x) 

where  —A  is  the  separation  constant.  This  gives  the  two  ODEs 

g'(t)  =  -A  kg(t),  -y"(x)  =  A  y{x). 

Substituting  the  product  u  —  yg  into  the  boundary  conditions  gives 


y(0)g(t)  =  0,  y{l)g{t)  +  y'(l)g(t)  =  0. 
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Since  g(t)  ^  0,  we  are  forced  to  take  y( 0)  =  0  and  y(l)  +  y'(l)  =0,  and  we  are 
left  with  the  Sturm-Liouville  problem 

-y"  =  Xy,  0  <  x  <  1;  j/(0)  =  0,  2/(1) +  2/'(l)  =  0. 

We  know  from  Example  4.12  that  the  eigenvalues  are  positive.  For  A  >  0  the 
solution  to  the  ODE  is  y(x)  =  A  cos  x/~\x  +  B  sin  \f\x.  The  boundary  condition 
y(0)  =0  forces  A  =  0,  and  the  boundary  condition  y(  1)  +  y'(l)  =  0  forces  the 
condition 

sin  v/A  + v7!  COS  Vx  =  o, 


or 


VX  =  —  tan  vX. 


This  equation,  where  the  variable  A  is  tied  up  in  a  trigonometric  function, 
cannot  be  solved  analytically.  But  plots  of  the  functions  VX  and  —  tan  \X 
versus  A  (see  Figure  4.2)  show  that  the  equation  has  infinitely  many  positive 
solutions  Ai,  A2, . . represented  by  the  intersection  points  of  the  two  graphs. 
Numerically,  we  find  Ai  =  4.115858,  A2  =  24.13934,  A3  =  63.65911. 


Figure  4.2  Graphical  representation  of  the  first  three  eigenvalues  of  the  SLP 
at  the  intersection  of  y  =  —  tan(vX)  and  y  =  \/(A) 


These  values  An  are  the  eigenvalues  for  the  Sturm-Liouville  problem,  and 
the  corresponding  eigenfunctions  are 

2 In  =  sin  \/XaX,  n  =  1,2,.... 

By  the  Sturm-Liouville  theorem  we  know  that  the  eigenfunctions  yn(%)  are 
orthogonal  on  0  <  x  <  1,  which  means  that 

1 

sin  x/X^x  sin  yj A mxdx  =  0  for  m  ^  n. 
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The  solution  to  the  time  equation  is 


g(t)  =  ce 


Xkt 


and  since  there  are  infinitely  many  values  of  A,  we  have  infinitely  many  such 
solutions, 

9n(t)  =  cne~Xnkt. 

Summarizing,  we  have  obtained  infinitely  many  product  solutions  of  the  form 


cne  Xnkt  sin  n  =  l,2.... 

These  functions  will  solve  the  PDE  (4.36)  and  the  boundary  conditions  (4.37). 
We  superimpose  these  to  form 


oo 


U 


Or,  t)  =  J2 


cne 


’  kt 


sin  vai, 


(4.39) 


n—  1 


and  we  select  the  coefficients  cn  such  that  u  will  satisfy  the  initial  condition 
(4.38).  We  have 


oo 


U 


Or,  0)  =  /  (x)  =  ^  c„  sin  vVr. 


ri—1 


This  equation  is  an  expansion  of  the  function  f(x)  in  terms  of  the  eigenfunctions 
yn(x).  We  know  from  Chapter  3  that  the  coefficients  cn  are  given  by  the  formula 


1 


— 


I \n 


2 

nil  JO 


/  / (a;)  sin  VA  nx  dx,  n  =  1,2 

Jo 


or 


1 


c 


n 


Jo1  sin2  y/Xjxdx  Jo 


/  f(x)  sin  x/XnX  dx,  n  =  1,2,.... 

Jo 


(4.40) 


Consequently,  the  solution  of  (4.36-4.38)  is  given  by  (4.39),  where  the  coeffi¬ 
cients  cn  are  given  by  (4.40). 

Using  MATLAB  to  calculate  a  three-term  approximation  to  the  solution 
in  the  case  k  =  0.1  with  initial  temperature  f(x)  =  10x(l  —  x),  we  find  the 
coefficients  c i,  C2,  cs  are  given  by 


ci  =  2.13285,  c2  =  1.040488,  c3  =  -0.219788. 

Therefore,  a  three-term  approximate  solution  is 

u(x,t)  «  2.13285e_4,115858t  sin  V4.115858X 
+  1.040488e“24  13934t  sin  V24.13934x 
-  0.219788e-63'659m  sin  \/63.65911or.  □ 
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We  end  this  section  with  some  summarizing  comments  and  terminology. 
For  the  regular  Sturm-Liouville  problem  (4.31-4.33),  Green’s  identity 


yi,Ly2)  -  (y2,Lyi)  =p 


y\y2  -ViWi 


b 

a 


was  an  essential  tool  in  obtaining  results  concerning  eigenvalues,  orthogonality 
of  eigenfunctions,  and  so  on.  As  we  showed,  the  boundary  conditions  have  the 
property  that  for  any  two  functions  y\ ,  1/2  satisfying  the  same  set  of  boundary 
conditions  (4.32-4.33), 


P 


y  m 


y'm 


(4.41) 


In  this  case  we  say  the  boundary  conditions  are  symmetric.  Whenever  we 
have  symmetric  boundary  conditions  for  a  regular  SLP,  it  is  clear  that 


(yi,Ly2)  =  (y2,Ly1). 


When  this  property  holds  with  corresponding  symmetric  boundary  conditions, 
we  say  that  the  differential  operator  L  is  symmetric,  or  self-adjoint. 


Example  4.14 

We  showed  that  the  differential  operator  Ly  =  —y" ,  a  <  x  <  6,  with  peri¬ 
odic  boundary  conditions  is  self-adjoint.  However,  one  can  check  (an  exercise) 
that  the  operator  L  =  —  (p(x)y')r,  a  <  x  <  b  is  not  self-adjoint  with  periodic 
boundary  conditions  unless  p(a )  =  p(b).  □ 


EXERCISES 

1.  Show  that  substitution  of  u(x,t)  =  g(t)y{x)  into  the  PDE 

ut  =  (p(x)ux)x  ~  c[(x)u,  a  <  x  <  b,  t  >  0, 
leads  to  the  pair  of  differential  equations 

9' it)  =  -A git),  ~(p(x)yry  +  q(x)y  =  A y, 
where  A  is  some  constant. 


2.  Show  that  the  SLP 


y"(x)  =  X y(x),  0  <  x  <  /, 

2/(0)  =  0,  y(0  =0, 
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with  mixed  Dirichlet  and  Neumann  boundary  conditions  has  eigenvalues 

(1  +  2u)tt  \  2 

21  ) 

and  corresponding  eigenfunctions 

(1  +  2n)7TX 
21 

for  n  —  0, 1,  2, . . .. 

3.  Consider  the  SLP 

-y"  =  \y,  0  <  x  <  1;  2/(0)  +  2/'(0)  =  0,  2/(1)  =  0. 

Is  A  =  0  an  eigenvalue?  Are  there  any  negative  eigenvalues?  Show  that 
there  are  infinitely  many  positive  eigenvalues  by  finding  an  equation  whose 
roots  are  those  eigenvalues,  and  show  graphically  that  there  are  infinitely 
many  roots. 

4.  Show  that  the  SLP 

-y"  =  \y,  0<£<2;  y(0)  +  2y'(0)  =  0,  3t/(2)  +  2/(2)  =  0, 

has  exactly  one  negative  eigenvalue.  Is  zero  an  eigenvalue?  How  many  pos¬ 
itive  eigenvalues  are  there? 

5.  For  the  SLP 

-y"  =  A y,  0  <x  <1;  y{ 0)  -  ay'{ 0)  =  0,  y(l)  +  by'(l)  =  0, 
show  that  A  =  0  is  an  eigenvalue  if  and  only  if  a  +  b  =  —  l. 

6.  What  can  you  say  about  the  sign  of  the  eigenvalues  for  the  SLP 

-y"  +  xy  =  -\y,  0  <  x  <  1,  2/(0)  =  2/(1)  =  0. 

Use  a  computer  algebra  package  to  find  the  eigenvalues  and  eigenfunctions. 
Hint:  Look  up  Airy’s  differential  equation. 

7.  Consider  the  regular  SLP 

-y"  +  q(x)y  =  A y,  0  <  x  <  /, 

2/(0)  =  2/(0  =  0, 

where  q(pc)  >  0  on  [0,  /].  Show  that  if  A  and  y  are  an  eigenvalue  and  eigen¬ 
function,  respectively,  then 

/o  ( y /2  +  qy2fx 

A  • 

y  2 

Is  A  0?  Can  y(af)  —  constant  be  an  eigenfunction? 
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8.  Does  the  boundary  value  problem 

—y"  =  A y,  a  <  x  <  b, 
y(a )  =  2/(6),  y'(6)  =  2y'(b), 

have  symmetric  boundary  conditions?  Is  it  self-adjoint? 

9.  Find  the  eigenvalues  and  eigenfunctions  for  the  following  problem  with 
periodic  boundary  conditions: 

—y"(x)  =  A y(x),  0  <  x  <  Z, 

y( o)  =  y(l),  y'( o)  =  y'(l). 

10.  Consider  a  large,  circular,  tubular  ring  of  circumference  21  that  contains 
a  chemical  of  concentration  c(x,  t)  dissolved  in  water.  Let  x  be  the  arc- 
length  parameter  with  0  <  x  <  21.  See  Figure  4.3.  If  the  concentration 


of  the  chemical  is  initially  given  by  f(x),  then  c(x,t)  satisfies  the  initial 
boundary  value  problem 

ct  =  Dcxx,  0  <  x  <2 Z,  t  >  0, 
c(0,  t)  =  c(2l,t),  cx(0,t)  =  Cx(2l,t),  t  >  0, 
c(x,  0)  =  /(x),  0  <  x  <  21. 

These  boundary  conditions  are  called  periodic  boundary  conditions, 
and  D  is  the  diffusion  constant.  Apply  the  separation  of  variables  method 
and  show  that  the  associated  Sturm-Liouville  problem  has  eigenvalues 
An  =  ( nn/l )2  for  n  =  0,1,2,...  and  eigenfunctions  yo(x)  =  1  ,yn(x)  = 
An  cos(n7rx/l )  +  Bn  sm(mrx/l)  for  n  =  1,  2, . . ..  Show  that  the  concentra¬ 
tion  is  given  by 

A  °°  222 

c(x,  t )  =  — -  +  cos(n7rx//)  +  Bn  sin(n7rx/l))e~n  n  Dt !l 

2  n— 0 
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and  find  formulas  for  the  An  and  Bn. 

11.  Find  eigenvalues  and  eigenfunctions  for  the  singular  problem 

— y"  +  x2y  =  A p,  x  G  R;  y  G  L2(M). 

Use  a  computer  algebra  system  if  needed. 

12.  Find  an  infinite  series  representation  for  the  solution  to  the  wave  problem 

Utt  —  C  U xxi  0  X  <C  /,  t  0, 

u(0,t)  =  ux(l,t)  =  0,  t>  0, 

u(x,  0)  =  /(#),  iq(:r,0)  =0,  0  <  x  <  l. 

Interpret  this  problem  in  the  context  of  waves  on  a  string. 

13.  Find  an  infinite-series  representation  for  the  solution  to  the  heat  absorption- 
radiation  problem 


Uf  —  ^xxi  0  x  <C  1,  t  0, 
ux(0,  t)  —  ao^(0,  t)  =  0,  ux(l,  t)  +  aiix(l,  t)  =  0,  t  >  0, 

'u(ayO)  =  /(#),  0  <  x  <  1, 

where  ao  <  0,  ai  >  0,  and  ao  +  ai  >  —  a^a\.  State  why  there  is  radiation  at 
x  =  1,  absorption  at  x  =  0,  and  the  radiation  greatly  exceeds  the  absorp¬ 
tion.  Choose  ao  =  —0.25  and  a i  =  4  and  find  the  first  four  eigenvalues; 
if  f(x)  =  x(l  —  x),  find  an  approximate  solution  to  the  problem  and  plot 
either  the  approximate  solution  surface  or  time  snapshots. 


4.3  Generalization  and  Singular  Problems 

A  straightforward  generalization  of  a  regular  Sturm-Liouville  problem  is  the 
SLP 


Lwy  =  -(p(x)y')'  +  q(x)y  =  A r(x)y,  a  <  x  <  b,  (4.42) 

otiy{a)  +  a2  y'{a)  =  0,  (4.43) 

Pi  y(b)  +  p2y'(b )  =  0,  (4.44) 

which  now  includes  a  positive,  continuous  weight  function  r  =  r(x)  on  the 


right  side  of  the  differential  equation.  Here,  the  same  conditions  on  p,  <7,  and 
the  boundary  conditions  hold  as  before.  Many  physical  processes  lead  to  this 
type  of  equation  after  separating  variables. 
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The  Sturm-Liouville  theorem  holds  exactly  as  before,  but  with  a  different 
definition  of  orthogonality.  We  define  the  weighted  inner  product  of  two 
functions  by 

pb 


2/i, 3/2 )  =  /  yi(x)y2(x)r(x)dx. 


W 


This  leads  to  the  weighted  norm 


(2/1, 2/2 )  • 

The  conclusions  of  Theorem  4.8  are  the  same  but  with  the  fact  that  the  eigen¬ 
functions  are  orthogonal  with  repect  to  the  weighted  inner  product.  Moreover, 
the  eigenfunctions  yn(x),  n  =  1,2,3,...  form  a  complete  orthogonal  set  of 
functions  on  [a,  6];  that  is,  if  /  is  square  integrable  in  the  weighted  norm,  then 

00 

f(x)  =  ^2cnyn(x), 

n— 1 


in  the  mean  square  sense,  where 


c 


n 


Example  4.15 

Consider  the  eigenvalue  problem 

-y"  +  y'  =  A?/,  0  <  x  <  1, 

2/(0)  =  2/(1)  =  0. 

As  it  stands,  this  problem  is  not  in  self-adjoint  form,  so  we  cannot  apply  The¬ 
orem  4.8  directly.  However,  if  we  multiply  by  the  integrating  factor  e~x ,  the 
equation  becomes 

j'  =  A  e~xy, 

which  is  of  regular  Sturm-Liouville  type,  with  weight  function  r(x)  =  e~x. 
One  can  use  an  energy  argument,  or  case  argument,  to  easily  show  that  the 
eigenvalues  are  positive  and  are  given  by 

i  1  22 

An  =  -  +  n  it  ,  n  —  1,2,... 

with  eigenfunctions 


e  xy 


yn{%)  =  sinmnr,  n  =  1,2,.... 
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The  eigenfunctions  are  orthogonal  with  respect  to  weight  function  e 

(dmdm)  =  /  ex/2  smriTrx  -  ex/2  smriTrx  e~xdx 
\  J  W  Jn 


-x , 


=  /  smnnxsmriTrx  dx  =  0. 
o 


Therefore,  if  /  is  square  integrable,  then  we  can  write 


oo 


/  o)  =  Y, 


cne  x  sinn7rx, 


n= 1 


in  the  mean  square  sense,  where  cn 


f,Vn 


c 


w 


n  —  ,  \  2 

dm  dn  ) 

/  u> 


,  n  =  1,2,3,....  □ 


Remark  4.16 

A  differential  equation  of  the  form 

yn  +  a(x)?/  +  6(x)?/  =  A?/ 

can  be  put  into  self-adjoint  form  my  multiplying  by  the  integrating  factor 

p(x)  =  . 

Then,  note  y'(x)  =  g(x)a(x),  and  the  differential  equation  becomes 

00)?/ )'  +  p(x)b(x)y  =  A  p(x)y, 


or 


(/jb(x)y  )  +  q(x)y  =  \/ji(x)y,  q(x)  =  fi(x)b(x).  □ 


Singular  Problems 

Singular  problems  occur  for  the  SLP 

Lwy  =  -(p(x)y')'  +  q(x)y  =  Xr(x)y ,  a  <  x  <  b,  (4.45) 

ai2/(a)  +  cx2d\o)  =  0,  (4.46) 

^i2/(fe)  +  /W(&)  =  o,  (4.47) 

when  p(#o)  =  0  at  some  point  xo  in  the  interval  [a,  b].  Here  we  assume  either 
p(a)  =  0  or  p(fr)  =  0.  In  the  case  p(a)  =  0  we  generally  replace  the  boundary 
condition  at  x  =  a  by  the  condition  that  y(a)  is  bounded. 

8  If  p(x)  =  0  at  an  interior  point  in  the  interval,  then  that  point  is  called  a  turning 
point  Turning  points  are  examined  in  more  advanced  treatments. 
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Example  4.17 

The  singular  SLP 


—  (xy'Y  =  A xy,  0  <  x  <  R, 
2/(0)  bounded,  y(R)  =  0 


occurs  in  the  initial  BVP  for  the  heat  equation  in  a  circular  disk  of  radius  R. 
This  is  discussed  in  detail  in  Section  4.6.  The  differential  equation  is  called 
Bessel’s  equation.  We  show  here  that  this  problem  is  self-adjoint.  Denoting 
Ly  =  —  {xy')r  and  letting  y\  and  2/2  be  two  functions  that  satisfy  the  boundary 
conditions,  by  Green’s  identity  we  get 


x 


yi(x)y’2{x)  -  y[(x)y2(x) 


R 

0 


=  -(o)  2/1  (0)2/2  (0)  —  3/1  (0)2/2  (0) 


So,  L  is  self-adjoint.  Therefore  the  results  of  the  Sturm-Liouville  theorem  hold: 
there  are  infinitely  many  discrete  eigenvalues  and  the  eigenfunctions  are  orthog¬ 
onal  with  weight  function  r(x)  =  x.  Refer  to  Section  4.6.  □ 


Example  4.18 

Consider  the  singular  SLP 


-y"  =  A y,  x>  0, 

2/(0)  =  0,  y(x)  bounded. 

The  problem  is  singular  because  the  interval  is  infinite.  It  is  clear  that  A  >  0 
because  negative  A  gives  exponential  solutions  which  cannot  satisfy  the  bound¬ 
ary  conditions;  zero  cannot  be  an  eigenvalue  for  the  same  reason.  The  general 
solution  is  y{x)  =  A  sin  \Z~\x-\-B  cos  yf\x.  The  left  boundary  condition  2/(0)  =  0 
forces  B  —  0.  We  are  left  with  y(x)  =  A  sin\/Ax.  Therefore,  the  bounded  solu¬ 
tions,  or  eigenfunctions,  are 

y\(x)  =  suia/A,  A  >  0, 

and  the  eigenvalues  are  all  positive  values  of  A;  this  is  a  continuous  set  of 
eigenvalues,  rather  than  discrete.  In  this  case  there  is  no  classical  orthogonality 
property  on  (0,  00).  However,  there  is  a  continuous  version  of  a  Fourier  expan¬ 
sion.  If  /  is  piecewise  continuous  on  x  >  0,  and  if  /  is  absolutely  integrable, 
or 

r  00 


0 


f(x)  |  dx  <  00, 
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then  we  can  represent  /  by  a  continuous  version  of  a  Fourier  sine  series,  namely, 


•oo 


7 r 


F(a)  sin  ax  da , 


o 


(4.48) 


where  we  have  let  a  =  \/A;  it  can  be  shown  that  the  coefficient  function  F(a) 
is  given  by 

POO 

F(a)  =  /  f(x)sinaxdx.  (4.49) 

J  o 

The  function  F(a)  in  (4.49)  is  called  the  Fourier  sine  transform  of  /  and 
is  commonly  denoted  by  S(f).  It  acts  in  a  similar  way  to  other  transforms 
where  derivative  operations  are  turned  into  multiplication  operations.  Fourier 
sine  transforms  are  convenient  for  solving  differential  equations  on  0  <  x  <  oo. 
Other  sources  contain  tables  of  sine  (and  cosine)  transforms  and  their  inverses, 
similar  to  those  for  Laplace  transforms.  We  refer  readers  to  Churchill  (1969, 
1972)  or  Farlow  (1993),  for  example.  □ 


EXERCISES 


1.  Is  the  partial  differential  equation 


(x2uxJ  +  x2uu  =  o 


separable?  Is 


separable? 


uXx  +  (x  +  y)  u 


2 -  -  0 


yy 


2.  Use  an  energy  argument  to  prove  that  the  SLP 


e  xy  j  =  Xe  xy ,  0  <  x  <  1,  y( 0)  =  y(  1)  =  0 

has  only  positive  eigenvalues. 

3.  Consider  the  SLP 

xy')  =  xy'  l<x <h>  y w  =  y^  = 

Find  the  eigenvalues  and  eigenfunctions,  and  write  down  the  Fourier  expan¬ 
sion  for  a  given  function  f(x)  on  the  interval. 

4.  Consider  the  SLP 

-  (ary)  =  Xy,  1  <  x  <  tt,  y(  1)  =  y(i r)  =  0. 

Use  an  energy  argument  to  show  that  any  eigenvalue  must  be  nonnegative. 
Find  the  eigenvalues  and  eigenfunctions. 
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5.  Show  that  the  differential  operator 

Ly  =  a(x)y"  +  b(x)y'  +  c(x)y 

can  be  written 

a(x) 
x) 

and  therefore  the  eigenvalue  problem  Ly  =  A y  can  be  written 

p(x)y'^j  +  q(x)y  =  A  r(x)y 

for  appropriately  defined  functions  functions  p,  <7,  and  r. 

6.  Let  L  be  the  differential  operator  defined  by  Ly  =  a(x)y"  +  b(x)yr  + 
c(x)y,  0  <  x  <  1.  Find  an  operator  L*  for  which 

(yiXy?)  =  (y2,L*yi)  +B(x)  |J, 

where  B(pc)  |  J  represents  boundary  terms.  The  operator  L*  is  called  the  for¬ 
mal  adjoint  of  L.  Hint:  Write  out  the  inner  product  (pi,  L7/2)  and  integrate 
by  parts  twice  to  remove  the  derivatives  on  y2  and  put  them  on  y\. 

7.  Consider  the  initial  BVP  for  the  wave  equation  on  0  <  x  <  l  with  variable 
sound  speed  c(x): 

Utt  =  c(x)2uXXJ  0  <  x  <  l,  t>  0, 
u(0,t)  =  u(l,t)  =  0,  t  >  0, 
u(x,  0)  =  /(x),  Ut(x,  0)  =0,  0  <  x  <  l. 

Find  the  Sturm-Liouville  problem  associated  this  BVP.  What  is  the  weight 
function  in  the  orthogonality  relation? 

8.  (Parameter  identification)  This  problem  deals  with  determining  the  unknown 
density  of  a  nonhomogeneous  vibrating  string  by  observing  one  funda¬ 
mental  frequency  and  the  corresponding  mode  of  vibration.  Consider  a 
stretched  string  of  unit  length  and  unit  tension  that  is  fastened  at  both 
ends.  The  displacement  is  governed  by  (see  Section  1.5) 

p(x)utt  =  uxx ,  0  <  x  <  1,  t  >  0,  (4.50) 

7i(0,  t)  =  u(l,  t)  =  0,  t  >  0,  (4.51) 

where  p(x)  is  the  unknown  density.  If  we  assume  a  solution  of  the  form 
u(x,t)  =  y(x)g(t),  then  we  obtain  a  Sturm-Liouville  problem 


p{x)y' 


'  c(x)p(x ) 

+  i:  y 


a(pc) 


y"  =  A p(x)y,  0  <  x  <  1;  y{ 0)  =  y{  1)  =  0. 
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Suppose  we  observe  (say,  using  a  strobe  light)  a  fundamental  frequency 
A  =  Af  and  the  associated  mode  (eigenfunction)  y  =  yf(x).  Show  that  the 
unknown  density  must  satisfy  the  integral  equation 

[  (1  —  x)yf(x)p(x)dx  =  ^  . 

Jo 

Hint:  Integrate  the  ODE  from  x  =  0  to  x  =  s  and  then  from  s  =  0  to  s  =  1. 
Determine  the  density  if  it  is  a  constant. 


9.  (Parameter  identification)  Consider  heat  flow  in  a  rod  of  length  n  and  unit 
diffusivity  whose  ends  are  held  at  constant  zero  temperature  and  whose 
initial  temperature  is  zero  degrees.  Suppose  further  that  there  is  an  exter¬ 
nal  heat  source  f(x)  supplying  heat  to  the  bar.  Address  the  question  of 
recovering  the  heat  source  f(x)  from  a  temporal  measurement  of  the  tem¬ 
perature  U (t)  made  at  the  midpoint  of  the  rod.  Proceed  by  formulating 
the  appropriate  equations  and  show  that  f(x)  must  be  the  solution  to  the 
integral  equation 


-  C/(0  <%  dr, 


where 


oo 


9&t)  =  ~Y1 

7T  z ' 


—  n2t  •  •  >. 

e  sin  —  sinnf 


n— 1 


Show  that  for  any  positive  integer  m,  the  pair 


u(x,  t) 


=  rn  3^2 


1 


—  m2t 


sm  mx 


f(x ) 


m  sm  mx 


satisfies  the  model  equations  you  proposed.  Show  that  for  m  large,  u(x,  t)  is 
(uniformly)  small,  yet  max|/(x)|  is  large,  and  thus  state  why  small  errors 
in  the  measurement  U (t)  may  lead  to  large  differences  in  f(x).  (This  means 
that  recovery  of  /  by  solving  the  integral  equation  above  may  be  unstable 
and  therefore  difficult.). 


4.4  Laplace’s  Equation 

In  Section  1.7  we  introduced  Laplace’s  equation,  which  is  given  in  two  and 
three  dimensions  by 
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respectively.  Applications  of  Laplace’s  equation  are  far-reaching,  from  steady- 
state  heat  flow,  electrostatics,  potential  flow  of  a  fluid,  deflections  of  a  mem¬ 
brane,  to  complex  analysis.  Solutions  of  Laplace’s  equation  are  called  har¬ 
monic,  or  potential  functions,  referring,  for  example,  to  the  potential  of  an 
electric  field.  Laplace’s  equation  itself  is  sometimes  called  the  potential  equa¬ 
tion. 

The  separation  of  variables  method  can  easily  be  applied  to  solve  Laplace’s 
equation  on  rectangles  in  two  dimensions,  just  exactly  as  the  heat  equation 
and  wave  equation  were  solved.  Exercises  at  the  end  of  this  section  provide 
examples.  In  Chapter  2,  using  Fourier  transforms,  we  solved  Laplace’s  equation 
in  a  half  space  y  >  0.  Presently,  we  extend  the  eigenfunctions  method  to  circular 
domains,  wedges,  and  annuli.  Later,  we  develop  some  general  properties  of 
harmonic  functions  in  both  two  and  three  dimensions. 

Temperatures  in  a  Disk 

Suppose  we  know  the  temperature  on  the  boundary  of  a  circular,  laminar 
disk  of  radius  R.  The  goal  is  to  find  the  equilibrium  temperatures  inside  the 
disk.  Separation  of  variables  works  in  a  straightforward  way  and  leads  to  a 
particularly  nice  integral  formula  for  the  solution. 

Because  the  region  of  interest  is  a  disk,  we  guess  that  polar  coordinates  are 
more  appropriate  than  rectangular  coordinates,  so  we  formulate  the  steady- 
state  heat  flow  problem  in  polar  coordinates  r  and  0,  where 

x  =  r  cos  0,  y  =  r  sin  0. 

Then  a  circular  disk  of  radius  R  can  be  represented  simply  as  0  <  r  <  77, 
0  <  0  <  27 r.  The  unknown  temperature  inside  the  disk  is  u  =  u(r,  0),  and  the 
prescribed  temperature  on  the  boundary  of  the  plate  is  u(R,0)  =  /(0),  where 
/  is  a  given  function.  We  know  from  prior  discussions  (Section  1.8)  that  u  must 
satisfy  Laplace’s  equation  A u  =  0  inside  the  disk.  Therefore,  upon  representing 
the  Laplacian  A  in  polar  coordinates  (see  Section  1.8),  we  are  faced  with  the 
following  boundary  value  problem  for  u(r,  6): 


1  1 

urr  H — ur  H — ^uee  =  0,  0  <  r  <  77,  0  <  0  <  27t, 

ry» 

(4.52) 

u(R,  0)  =  f(0), 

0  <  9  <  2tt. 

(4.53) 

Implicit  are  periodic  boundary  conditions 

o 

II 

to 

£ 

<as 

,  0)  =  uo(r,2  7r). 

(4.54) 

The  separation  of  variables  method  works  the  same  way  as  in  rectangular  coor¬ 
dinates.  We  assume  a  product  solution  u(r,0)  =  y(r)g(0)  and  substitute  into 
the  PDE  to  obtain 
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y"(r)g(6)  +  -y'(r)g(6)  +  \y{r)g"(6)  =  0. 

ry»  ry* 

This  can  be  written 

r2y"(r)  +  ry'(r)  g"(0) 

W)  =  TT  =  “  ’ 

where  —A  is  the  separation  constant.  Therefore,  we  obtain  two  ordinary  differ¬ 
ential  equations  for  y  and  g ,  namely, 

rV'(r)  +  ry'(r)  =  A y(r),  g”(0)  =  -A g{9). 

The  periodic  boundary  conditions  force  g{ 0)  =  g{ 2tt)  and  g'(0)  =  g'( 2tt).  There¬ 
fore,  we  have  the  following  self-adjoint  SLP  problem  for  g(9): 

g'\e)  =  -A g(6);  g( 0)  =  g( 2tt),  ^(0)  =  g’{ 2tt).  (4.55) 

First,  it  is  clear  that  A  =  0  is  an  eigenvalue  with  corresponding  eigenfunc¬ 
tion  go  (6)  =  1-  Moreover,  there  are  no  negative  eigenvalues;  if  A  is  negative, 
then  the  ODE  has  exponential  solutions,  and  exponential  solutions  cannot  sat¬ 
isfy  periodicity  conditions.  (Or,  one  could  apply  an  energy  argument  to  show 
nonnegativity.)  Therefore,  let  us  assume  A  =  p2  >  0.  The  differential  equation 
for  g  has  general  solution 


g{0)  =  a  cos  pO  +  b  sin  pO. 

The  periodic  boundary  conditions  force 

(cos  27 rp  —  l)a  +  (sin  2irp)b  =  0, 

(sin27rp)a  +  (1  —  cos27t  p)b  =  0. 

This  is  a  system  of  two  linear  homogeneous  algebraic  equations  for  the  constants 
a  and  b.  From  matrix  algebra,  we  know  that  it  has  a  nontrivial  solution  if  the 
determinant  of  the  coefficients  is  zero,  i.e., 

(cos27rp  —  1)(1  —  cos27rp)  —  sin2  2np  =  0, 


or,  simplifying, 


This  means  that  p 


cos  2irp  =  1. 

yf\  must  be  a  positive  integer,  that  is, 


r\ 

A  —  A fi  —  ^7  r?'  —  1,2,.... 
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Along  with  Ao  =  0,  these  are  the  eigenvalues  of  the  problem  (4.55).  The  eigen¬ 
functions  are 


go(9)  =  1,  gn  ($)  =  an  cos  tiO  T  bn  sin  tiO  ,  n  —  1,2,.... 

Next  we  solve  the  ^/-equation.  Of  course,  we  want  bounded  solutions.  For 
A  =  0  the  only  bounded  solution  is  yo(r)  =  1  (the  other  independent  solution 
in  this  case  is  lnr,  which  is  unbounded).  For  A  =  n2  the  equation  is 

r2y"  (r )  +  ry'(r)  —  n2y{r)  =  0, 

which  is  a  Cauchy-Euler  equation  (see  the  Appendix)  with  general  solution 

Vn(r)  =  cnr~n  +  dnrn .  (4.56) 


We  can  set  cn  =  0  because  we  want  bounded  solutions  on  0  <  r  <  R.  Thus, 
setting  dn  =  1  for  all  n,  we  have 

yn{r)  =  rn,  n  =  1,2, ... . 


In  summary,  we  constructed  solutions  of  the  given  boundary  value  prob¬ 
lem  of  the  form  uo(r,Q)  =  constant  =  ao/2,  u  =  un(r,6 )  =  rn(an  cos  nS  + 
bn  sin  n0),  n  =  1,2,....  To  satisfy  the  boundary  condition  at  r  =  A*  we  form 
the  linear  combination 

oo 

u{r ,  0)  =  —  +  rn(an  cos  nO  +  bn  sinnO).  (4.57) 

n=l 

The  boundary  condition  r^(J7,  6)  =  f(0)  then  yields 


ao 


oo 


+  Tr" 


a 


n 


cos  nO  +  bn  sin  nO 


n— 1 


which  is  the  full-range  Fourier  series  for  /(0)  (see  Section  3.3).  Therefore,  the 
coefficients  are  given  by 


a 


n 


1 


•27T 


7 xRn 


f{6)  cos  nO  dO , 


o 


1 


•27T 


7ri^n 


f(6)  sin  nO  d6 , 


o 


n  =  0, 1,  2, . . . , 
n  =  1,2,.... 


(4.58) 

(4.59) 


Hence,  the  solution  to  the  BVP  (4.52-4.54)  is  given  by  (4.57)  with  the  coeffi¬ 
cients  given  by  (4.58-4.59). 


As  it  turns  out,  this  infinite  series  solution  can  be  cleverly  manipulated  to 
obtain  a  simple  formula,  called  Poisson’s  integral  formula,  for  the  solution.  Let 
us  substitute  the  coefficients  given  in  formulas  (4.58)  and  (4.59)  (after  changing 
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the  dummy  integration  variable  from  9  to  </>)  into  the  solution  formula  (4.57) 
to  obtain 

•27 r 


u(r,  6) 


1 


27 r 


/O)  d<f> 


0 


oo 


+  E 


r 


n  /*27t 


n=l 


nRn 


f(q *))  (cos  n<j)  cos  +  sin  n</>  sin  n0)d(j) 


o 


1 


27T 


/(</>)  (l  +  2  E  (^)  cos  n(0  -0)J 


The  infinite  sum  in  the  integrand  can  be  determined  exactly  as  follows.  Recall¬ 


ing  the  identity  cos  a  =  |(em  +  e  m),  we  can  write 


oo 


oo 


oo 


■in(0  —  4>) 


1  +  !E(s)  c°8  »(«-<«  =  1  +  E  ©  c«*-«  +  E(s 

n=l  n=l  n=l 

But  each  series  on  the  right  side  is  a  geometric  series,  and  we  know  from  calculus 
that  _ 

oo 

Z 


E 

n—  1 


= 


1-Z 


,  provided  that  z  <  1 


Therefore  we  get 


oo 

1  +  E(^)  cosn(6>  -  (f>)  =  1  + 

n—  1 


re 


+ 


re 


R  —  0)  R  —  4^  0) 

R2  -  r2 


R2  _|_  r2  _  2rRcos(6  —  </>) 
Assembling  the  results,  we  have  the  following. 


"heorem  4.19 


(Poisson’s  integral  formula)  If  /  is  a  continuous  function,  then  the  BVP  for 
Laplace’s  equation  (4.52-4.54)  on  a  circular  domain  of  radius  R  is  given  by 


u(r,  6)  = 


1 


•27 r 


R2-r2) /(</>) 


27t  J o  R2  r2  —  2rRcos(6  —  </>) 


dcj).  □ 


(4.60) 


Remark  4.20 


Poisson’s  integral  formula  is  unwieldy  for  calculations.  But  it  is  an  important 
theoretical  tool.  For  example,  if  we  use  the  formula  to  find  the  temperature  at 
the  origin,  we  immediately  get 


1 

27T 


f{4>)d<t> ■ 


u{  0, 0) 
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In  other  words,  the  temperature  at  the  center  of  the  disk  is  the  average  of  the 
prescribed  boundary  temperatures.  □ 


Remark  4.21 

(Exterior  Dirichlet  Problem)  By  the  same  method,  we  can  show  that  the 
bounded  solution  to  the  exterior  Dirichlet  problem 

1  1 

urr  H — ur  H — ~uqo  =  0,  r  >  R,  0  <  0  <  27r, 

ry*  ry»  A 

u(R ,  0)  =  /(0),  0  <  6  <  2tt, 


where  /  is  continuous,  is 


u(r ,  6)  = 


1 


•  27 r 


(r2  -  R2 )/(</>) 


27t  J q  R?  +  r2  —  2rRcos(6  —  0) 


d<f>.  □ 


Solving  Laplace’s  equation  on  portions  of  circles  (wedges)  and  for  annular 
domains  is  similar.  Readers  familiar  with  complex  analysis  may  recall  that  some 
unusual-shaped  domains  can  be  mapped  conformally  onto  a  simple  domain 
where  Laplace’s  equation  can  be  solved;  conformal  transformations  preserve 
solutions  to  Laplace’s  equation,  and  so  this  method  can  result  in  a  closed  form 
solution,  i.e.,  a  formula,  for  the  answer.  Many  elementary  texts  on  complex 
analysis  discuss  this  method  (see,  for  example,  Churchill  (1960)  for  an  elemen¬ 
tary  discussion). 

As  an  aside,  the  most  common  method  in  practice  for  solving  Laplace’s 
equation  on  any  bounded  domain,  regardless  of  its  shape,  is  to  use  a  numerical 
method.  For  example,  in  the  method  of  finite  differences  (see  Exercise  6  in 
Section  1.8)  the  derivatives  are  approximated  by  difference  quotients,  and  the 
solution  u(x,  y )  is  determined  at  discrete  lattice  points  inside  the  region.  This 
method  is  further  discussed  in  Chapter  6.  Another  widely  used  method  not 
discussed  here  is  the  finite  element  method ,  which  also  yields  the  approximate 
solution  at  discrete  points  of  the  domain. 

General  Results  for  Laplace’ s  Equation 

In  the  next  few  paragraphs  we  introduce  some  basic  properties  of  solu¬ 
tions  to  Laplace’s  equation  on  bounded  spatial  domains  in  three  dimensions. 
The  results  also  apply  to  two  dimensions.  Many  of  the  assertions  follow 
from  the  divergence  theorem,  which  states  that  for  a  smooth  vector  field 
4>  =  <p(x,y,z ), 


div  0  dV  =  /  0  •  n  dA. 


n 


on 


(4.61) 
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The  divergence  theorem  is  a  version  of  the  fundamental  theorem  of  calculus  in 
three  (or  two)  dimensions — it  converts  the  integral  of  derivatives,  a  divergence, 
to  an  integral  over  the  boundary.  We  know  from  elementary  calculus  that  the 
divergence  measures  the  local  outflow  per  unit  volume  of  a  vector  field  0, 
so  the  divergence  theorem  states  that  the  net  outflow,  per  unit  volume,  in 
a  region  ft  must  equal  the  flux  through  the  boundary  (surface)  dft  of  that 
region.  Recall  that  the  integral  on  the  right  is  a  flux  integral  through  the 
surface  oriented  by  the  outward  unit  normal  vector  n.  Here  we  are  taking  a 
fluid  flow  interpretation  in  speaking  of  outflow,  but  the  interpretation  is  valid 
for  any  smooth  vector  field.  We  always  take  ft  to  be  a  nice  region  with  a  smooth 
boundary  or  with  a  boundary  made  up  of  finitely  many  smooth  sections.  There 
are  some  pathological  domains  on  which  the  divergence  theorem  does  not  apply, 
but  we  do  not  consider  those.  Moreover,  we  always  assume  that  the  functions 
involved  in  our  discussion  are  sufficiently  smooth  for  the  theorems  to  apply. 
For  example,  the  functions  should  have  two  continuous  partial  derivatives  in 
17,  be  continuous  on  ft  U  di?,  and  have  first  partial  derivatives  that  extend 
continuously  to  the  boundary. 

Two  important  integral  identities,  called  Green’s  identities,  follow  from  the 
divergence  theorem.  These  identities  are  higher  dimensional  versions  of  Green’s 
identity  for  Sturm-Liouville  problems  obtained  in  Section  4.2.  To  begin,  note 
that  if  u  is  a  scalar  field  and  0  is  a  vector  field,  then 


div  (u<fi)  =  u  div  0  +  0  •  grad  u 

(see  Exercise  4,  Section  1.7).  Integrating  over  the  volume  ft  and  using  the 
divergence  theorem  gives 

u  0  •  n  dA  =  u  div  0  dV  +  /  0  •  grad  u  dV. 

df2  J  ft  J  Q 

Setting  0  =  grad  v  for  a  scalar  function  v  then  gives 

u  grad  v  •  n  dA  =  /  u  Av  dV  /  grad  v  •  grad  u  dV.  (4.62) 

dQ  J  Q  J  i 1 

Here  we  have  used  the  fact  that  div  (grad  u)  =  A u,  the  Laplacian.  In  particular, 
if  we  set  r  =  w,  we  obtain 


ugra,du-ndA  =  /  uAudV  A  /  grad  u  •  grad  u  dV,  (4.63) 

df2  J f2  J Q 

and  this  is  Green’s  first  identity.  Note  that  in  one  dimension  this  identity 
is  the  same  as 


yy"  dx 


yy 


a 


a 


y ')  dx, 


which  is  just  the  integration  by  part  formula. 
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To  obtain  Green’s  second  identity,  interchange  v  and  u  in  equation  (4.62) 
and  subtract  that  result  from  (4.62)  to  get 


uAv  dV  = 


vAu  dV  + 


/  (u  grad  v  —  v  div  u) 
Jon 


n  dA.  (4.64) 


This  is  Green’s  second  identity;  it  can  be  regarded  as  an  integration  by 
parts  formula  for  the  Laplacian  A;  note  how  the  Laplacian  is  taken  off  v  and 
put  on  u  and  a  boundary  term  is  produced. 


Notation.  We  often  use  the  notation 

du 


—  =  n  •  grad  u 
dn 


to  denote  the  normal  derivative ,  i.e.,  the  derivative  of  u  on  a  boundary  in  the 
direction  of  the  outward  unit  normal.  □ 

Green’s  identities  can  be  used  to  prove  many  interesting  facts  about  prob¬ 
lems  involving  Laplace’s  equation.  To  begin,  we  show  that  solutions  to  the 
Dirichlet  problem  are  unique. 


"heorem  4.22 


A  solution  to  the  Dirichlet  problem 


A u  =  0  in  17, 
u  =  f(x,y,z)  on  517, 


is  unique.  □ 

By  way  of  contradiction,  assume  that  there  are  two  solutions,  u  and  v.  Then 
the  difference  w  =  u  —  v  must  satisfy  the  problem 

Aw  =  0  in  17, 
w  =  0  on  <9 17. 

Multiply  this  PDE  by  w  and  integrate  over  the  region  17  to  get 

wAw  dV  =  0. 


Green’s  first  identity  (4.63)  implies 


w  grad  w  •  n  dA  —  /  grad  w  •  grad  w  dV  =  0. 
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The  first  integral  on  the  left  is  zero  because  w  =  0  on  the  boundary.  Therefore, 


/  grad  w  •  grad  w  dV  =  0. 

Jo 

Because  the  integrand  is  never  negative,  it  follows  that  grad  w  =  0,  or  w  = 
constant  in  17.  Because  w  =  0  on  the  boundary,  the  constant  must  be  zero, 
and  so  w  =  0  in  17  U  917.  This  means  that  u  =  v  in  17  U  917.  So  there  cannot 
be  two  solutions.  The  reader  should  recognize  the  proof  above  as  an  “energy 
argument”  of  the  type  introduced  in  Section  4.3  for  SLPs.  □ 

Another  interesting  property  of  harmonic  functions  satisfying  Dirichlet-type 
boundary  conditions  is  that  they  minimize  the  energy  integral 


E(w) 


grad  w  •  grad  w  dV. 


In  electrostatics,  for  example,  where  w  is  the  electric  field  potential,  the  energy 
E  is  the  energy  stored  in  the  electrostatic  field. 


Theorem  4.23 

(Dirichlet’s  principle)  Suppose  u  satisfies 

A u  =  0  in  17;  u  —  f  on  917. 

Then  E(u)  <  E(w)  for  all  w  satisfying  w  =  /  on  df2.  In  other  words,  of 
all  functions  that  satisfy  the  boundary  condition,  the  solution  to  Laplace’s 
equation  is  the  one  that  minimizes  the  energy.  □ 

To  prove  Dirichlet’s  principle,  let  w  =  u  +  v  where  v  =  0  on  the  boundary  917. 
Then 


E(w)  =  E(u  +  v) 


/  grad  (u  +  v)  •  grad  (u  +  v)  dV 

J  Q 

grad  u  •  grad  u  dV  +  2  /  grad  u  •  grad  v  dV 

Q  J  £2 

+  /  grad  v  •  grad  v  dV 

J n 

E(u )  +  E(v)  +  2  /  grad  u  •  grad  v  dV. 

Jf2 


But  the  integral  term  on  the  right  is  zero  by  equation  (4.62).  And  because 
E{v)  >  0,  we  have  E(u)  <  E(w ),  which  completes  the  proof.  □ 
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EXERCISES 

1.  Consider  the  pure  boundary  value  problem  for  Laplace’s  equation  given  by 

xx  "b  'Llyy  —  0?  0  X  <C  0  7/  <d  1 9 

u{ 0,  y )  =  0,  u(l,  y)  =  0,  0  <  y  <  1, 

0)  =  0,  u(x,  1)  =  G(x),  0  <  x  <  l. 

Use  the  separation  of  variables  method  to  find  an  infinite  series  representa¬ 
tion  of  the  solution.  Here,  take  u(x,  y )  =  (j)(x)fip(y)  and  identify  a  boundary 
value  problem  for  4>(x);  proceed  as  in  other  separation  of  variables  prob¬ 
lems. 

2.  Find  an  infinite  series  representation  for  the  solution  to  the  equilibrium 
problem 

uxx  +  uyy  =  0,  0  <  x  <  a,  0  <  y  <  6, 

Ux( 0,  y)  =  ux(a,  y)  =  0,  0  <  y  <  b, 

u(x,  0)  =  f(x),  u(x,b)  =  0,  0  <  x  <  a. 

Interpret  this  problem  in  the  context  of  steady  heat  flow. 

3.  Solve  the  boundary  value  problem 

A u  =  0,  r  <  R,  0  <  0  <  27t, 
u(R ,  0)  =  4  +  3  sin  0,  0  <  6  <  2n. 

4.  Show  that  the  solution  u  =  u(r ,  0)  to  the  exterior  boundary  value  problem 

A u  =  0,  r  >  R,  0  <  0  <  27r, 
u(R ,  0)  =  /(0),  0  <  0  <  27t, 

is  given  by 

1  ,2,  (ri-RAfU) 

U(r’e)  =  ^j„  m  *  r> *)**■ 

Hint:  Follow  the  example  in  the  book  but  choose  the  other  bounded  solution 
in  equation  (4.56). 


A u  =  0,  r  >  1,  0  <  0  <  27t, 
tz(1,0)  =  cos0,  0  <9  <  2tt. 


5.  Solve 
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6.  Find  an  infinite  series  solution  to  the  following  boundary  value  problem  on 
a  wedge: 

A u  =  0,  r  <  R,  0  <  0  <  7t/2, 
u(R ,  6)  =  /(0),  0  <  6  <  7r/2, 
u(r,  0)  =  0,  u(r ,  tt/2)  =0,  0  <  r  <  R. 

7.  Solve  the  boundary  value  problem  in  Exercise  5  with  the  boundary  condi¬ 
tions  along  the  edges  9  =  0,  6  =  tt/2  replaced  by 

u(r,  0)  =  0,  ue(r ,  tt/2)  =0,  0  <  r  <  i?. 

8.  Solve  the  Dirichlet  problem  on  an  annulus: 

A  u  =  0,  l<r<2,  0  <  0  <  27t, 
u(  1,  9)  =  sin  0,  0  <  9  <  2tt, 
u( 2,  0)  =  cos  0,  0  <  0  <  2tt. 

9.  Find  a  bounded  solution  to  the  exterior  boundary  value  problem 

A u  =  0,  r  >  R, 

u  —  1  +  2  sin  9  on  r  =  R. 

10.  Prove  Dirichlet’s  principle  for  the  Neumann  problem:  Let 

E(w)  =  -  f  grad  •  grad  w  dV  —  I  hw  dA. 

2  Jf2  Jdf2 

If  u  is  the  solution  to  the  BVP 

A u  =  0  in  !?, 
n  •  grad  u  =  h  on 

then  E(u)  <  E(w)  for  all  w  sufficiently  smooth.  Observe  that  the  average 
value  of  h  on  the  boundary  is  zero. 

11.  Show  that  a  necessary  condition  for  the  Neuman  problem 

A u  =  /  in  !?, 
n  •  gradi/  =  h  on 

to  have  a  solution  is  that 

f  fdV  =  f  gdA. 

JQ  Jdf2 

Interpret  this  result  in  the  context  of  heat  conduction. 
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12.  Find  radial  solutions  u  =  u(p)  of  the  equation 

A  u  =  k2u 


in  spherical  coordinates.  Hint:  Let  u  =  pw. 

13.  In  three  dimensions  consider  the  radiation  boundary  value  problem 

A u  —  cu  =  0  in  17, 
n  •  grad  u  +  au  =  0  on  c?!7, 

where  a,  c  >  0  and  17  is  a  bounded  region.  Show  that  the  only  solution  is 
u  =  0.  Hint:  Use  an  energy  argument.  Multiply  the  PDE  by  u ,  integrate, 
and  then  use  Green’s  first  identity. 

14.  Use  the  last  exercise  to  show  that  solutions  to  the  boundary  value  problem 

A u  —  cu  =  g(x,  y,  z)  (x,  y ,  z)  G  17, 
n  •  grad  u  +  au  =  /(#,  y ,  z)  on  917, 


are  unique. 

15.  Suppose  u  =  u(x,  2/,  z)  satisfies  the  Neumann  problem 

A^  =  0,  in  17, 
n  •  grad  u  =  0  on  9 17. 

Show  that  u  must  be  constant  on  17. 


16.  Suppose  u  =  u(x,  7/,  z)  satisfies  the  Robin  problem 


du 

dn 


Au  =  0  in  17, 

+  au  =  0  on  dft. 


Show  that  solutions  are  unique  if  a  >  0. 

17.  Show  that  the  Poisson  integral  formula  can  be  written  in  vector  form  as 


u(x.) 


Ft 


x 


u{y) 


27 tR 


|y| =R 


x-y 


where  the  integral  is  a  line  integral  with  respect  to  arclength  over  the  circle 
=  R. 

18.  Show  that  the  Neumann  problem  on  the  unit  disk,  Au  =  0  in  r  <  1, 
u  =  sin2  6  on  r  =  1,  does  not  have  a  solution. 


19.  Consider  the  Dirichlet  problem  between  two  spheres:  Au  =  0  in  Ri  <  p  < 
R2 ,  and  u  =  a  on  p  =  i?i,  u  =  b  on  p  =  R2.  Find  the  solution  and  sketch 
graphs  of  u  vs.  p  in  the  cases  a  <  b  and  b  <  a. 


198 


Bounded  Domains 


4.5  Cooling  of  a  Sphere 

In  this  section  we  present  the  solution  to  a  classical  problem  in  three  dimensions, 
the  cooling  of  a  sphere.  The  symmetries  in  the  problem  permits  us  to  reduce 
the  dimension  of  the  problem  to  one  spatial  dimension  (the  radius)  and  time, 
and  we  can  follow  the  same  procedures  as  in  previous  sections. 

The  problem  is  this:  Given  a  sphere  whose  initial  temperature  depends 
only  on  the  distance  from  the  center  (e.g.,  a  constant  initial  temperature)  and 
whose  boundary  is  kept  at  a  constant  temperature,  predict  the  temperature  at 
any  point  inside  the  sphere  at  a  later  time.  This  is  the  problem,  for  example,  of 
determining  the  temperature  in  a  spherically-shaped  object,  e.g.,  a  potato,  that 
has  been  placed  in  a  hot  oven.  The  reader  could  conjecture  that  this  problem 
is  important  for  medical  examiners  who  want  to  determine  the  time  of  an 
individual’s  death  by  measuring  temperatures  in  brain  tissue.  Early  researchers, 
notably  Kelvin,  used  this  problem  to  determine  the  age  of  the  earth  based  on 
conjectures  about  its  initial  temperature  and  its  temperature  today. 

This  problem  is  reminiscent  of  Newton’s  law  of  cooling,  which  is  encountered 
in  ordinary  differential  equations  texts.  It  states  that  the  rate  at  which  a  body 
cools  is  proportional  to  the  difference  of  its  temperature  and  the  temperature 
of  the  environment.  Quantitatively,  if  T  =  T(t)  is  the  temperature  of  a  body 
and  Te  is  the  temperature  of  its  environment,  then  T'(t )  =  —  h(T  —  Te),  where 
h  is  the  constant  heat  loss  coefficient.  But  the  reader  should  note  that  this  law 
applies  only  in  the  case  that  the  body  has  a  uniform ,  homogeneous  temperature. 
In  the  PDE  problem  we  are  considering,  the  temperature  may  vary  radially 
throughout  the  body. 

For  simplicity,  we  consider  a  sphere  of  radius  p  =  tt  whose  initial  tempera¬ 
ture  is  Tq  =  constant.  We  assume  the  boundary  is  maintained  at  zero  degrees 
for  all  time  t  >  0.  In  general,  the  temperature  u  inside  the  sphere  depends  on 
three  spherical  coordinates,  p,  0,  (/>,  and  time.  But  reflection  reveals  that  the 
temperature  will  depend  only  on  the  distance  p  from  the  center  of  the  sphere 
and  on  time;  that  is,  u  =  u(p,  t).  There  are  no  variations  in  the  initial  or  bound¬ 
ary  conditions  that  would  cause  gradients  to  change  in  the  0  and  coordinate 
directions.  Evidently,  the  temperature  must  satisfy  the  heat  equation 

Ut  =  kAu , 

where  k  is  the  diffusivity  and  A  is  the  Laplacian  (see  Section  1.7).  Because  u 
does  not  depend  on  <fi  and  0,  the  Laplacian  takes  on  a  particularly  simple  form: 
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(see  Section  1.8).  Therefore,  we  can  formulate  the  model  as 

(  2  \ 

ut  =  k  I  upp  H — up  j  ,  0  <  p  <  7r,  t  >  0, 

u(tt,  t)  =  0,  t  >  0, 

u(p,  0)  =  To,  0  <  P  <  7T. 


(4.65) 

(4.66) 

(4.67) 


There  is  an  implied  boundary  condition  at  p  =  0,  namely  that  the  temperature 
should  remain  bounded. 

To  solve  problem  (4.65-4.67)  we  assume  u(p,t)  =  y(p)g(t).  Substituting 
into  the  PDE  and  separating  variables  gives,  in  the  usual  way, 

g'(t)  y"(p )  +  h'(p) 

kg(t )  “  y{p) 

where  —A  is  the  separation  constant.  Substituting  u(p,t)  =  y(p)g(t)  into  the 
boundary  condition  (4.66)  gives  y( tt)  =  0,  and  the  boundedness  of  u  at  p  =  0 
forces  7/(0)  to  be  bounded.  Therefore,  we  get  the  boundary  value  problem 

-  y"(p)  -  - y\p )  =  Ay(p),  o  <  p  <  tt,  (4.68) 

p 

2/(0)  bounded,  y(7r)  =  0.  (4.69) 

This  is  a  singular  SLP. 

If  A  =  0,  then  the  equation  is  Cauchy-Euler,  having  the  form  —y"  —  ^y'  =  0. 
The  general  solution  is  y  =  a/ p  +  b.  The  boundedness  condition  at  zero  forces 
a  =  0,  and  y(jr)  =  0  implies  6  =  0.  Therefore  A  =  0  is  not  an  eigenvalue.  If 
A  7^  0,  it  may  not  be  clear  to  the  reader  how  to  solve  the  variable-coefficient 
equation  (4.68).  However,  if  we  introduce  the  new  dependent  variable  Y  =  Y(p) 
defined  by 

Y{p)  =  py{p), 

then  (4.68)  transforms  into  the  familiar  equation 

-Y"(p)  =  A  Y(p). 

If  A  >  0,  then  Y  =  a  cos  \f\p  +  b  sin  yf\ p,  which  gives 

y(p)  =  -  (a  cos  a/A p  +  b  sin  V\p^j . 

We  must  have  a  =  0  because  cosVXp/p  is  unbounded  at  p  =  0.  Therefore, 
applying  the  boundary  condition  y( tt)  =  0  then  gives 

b  sin  V\tt  =  0, 
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which  in  turn  yields  the  eigenvalues 


2 

A  —  A  fi  —  n  ,  n  —  1,2,.... 


The  eigenfunctions  are 


Vn(p) 


sin  np 

7 

P 


n  =  1,2, 


We  leave  it  as  an  exercise  to  show  that  there  are  no  negative  eigenvalues  (see 
Exercise  1). 

The  corresponding  solutions  to  the  time  equation  for  g(t)  are  easily  found 
to  be  gn(t )  =  cne~n  kt .  Thus  we  have  determined  infinitely  many  solutions  of 
the  form 

/  x  _ ^2 1.  sin  Tip  _ 

un(p,t)  =  cne  - ,  n  =  1,2,..., 

P 

that  satisfy  the  PDE  and  the  boundary  conditions.  To  satisfy  the  initial  con¬ 
dition  (4.67)  we  form  the  linear  combination 


oo 


U 


*)  =  E 


cne 


n=l 


-u2kt®nnp 

P 


Applying  the  initial  condition  gives 


oo 


U 


(P,  0)  =  To  =  ^2  Cn 


n— 1 


sm  np 
P 


or 


oo 


Top  =  cn  sin  np. 

n— 1 

The  right  side  of  this  equation  is  the  Fourier  sine  series  of  the  function  Top  on 
the  interval  [0,  tt]  .  Therefore,  the  Fourier  coefficients  cn  are  given  by 


2  r 

cn  —  —  /  Top  sin  np  dp. 

TT  Jo 


The  integral  can  be  calculated  by  hand  using  integration  by  parts.  We  get 


c 


n 


(-1) 


71+1  2^0 

n 


In  summary,  we  have  derived  the  solution  formula  for  the  model  (4.65-4.67): 


oo 


2Tn 


u 


(P,o  =  E(-1)n+We 

/  i 


n 


n—  1 


2fctsmnp 

P 


(4.70) 


This  formula  gives  the  temperature  at  time  f  at  a  distance  p  from  the  center 
of  the  sphere. 
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1.  Show  that  the  eigenvalue  problem  (4.68-4.69)  has  no  negative  eigenvalues. 

2.  In  the  cooling  of  the  sphere  problem  take  R  =  tt  inches,  To  =  37  degrees 
Celsius,  and  k  =  5.58  inches-squared  per  hour.  Using  (4.70),  plot  the  tem¬ 
perature  at  the  center  of  the  sphere  as  a  function  of  time;  take  t  =  0  to 
t  =  1  hour. 

3.  This  problem  deals  with  the  cooling  of  a  sphere  with  a  radiation  boundary 
condition.  A  spherical  ball  of  radius  R  has  a  diffusivity  k  and  initial  tem¬ 
perature  u  =  /(p),  0  <  p  <  R,  which  depends  only  on  the  distance  p  from 
the  center.  Heat  radiates  from  the  surface  of  the  sphere  via  the  law 


up(RR)  =  — hu(RR ), 


where  h  is  a  positive  constant  and  Rh  <  1.  Find  a  formula  for  the  temper¬ 
ature  u(p,  t )  in  the  sphere.  Hint:  Find  the  eigenvalues  Xn  as  the  positive 
roots  of  the  equation 


tan  RX 


RX 

1  —  Rh 


Next,  take  R  =  tt  inches,  k  =  5.58  inches-squared  per  hour,  /(p)  =  37 
degrees  Celsius,  and  h  =  0.1  per  inch,  and  calculate  a  four  term  approxi¬ 
mate  solution  to  the  temperature  function. 


4.  Consider  a  sphere  of  unit  radius  on  which  there  is  a  prescribed  potential  u 
depending  only  on  the  spherical  coordinate  angle  (see  Section  1.8).  This 
exercise  deals  with  using  separation  of  variables  to  find  an  expression  for 
the  bounded  potential  that  satisfies  Laplace’s  equation  inside  the  sphere. 
The  boundary  value  problem  is 


A  u  =  0,  0  <  p  <  1,  O<0<7r;  u(  1,  </>)  =  /(</>),  0  <  0  <  7T, 

where  u  =  u(p,  <fi).  Notice  that  u  will  not  depend  on  the  polar  angle  0 
because  of  the  symmetry  in  the  boundary  condition. 

(a)  Assume  u  =  R(p)Y (0)  and  derive  the  two  equations 

—  ^(1  —  x2)y'^j  =X y,  —  1  <  x  <  1;  (^p2R'^j  —  XR  =  0, 

where  x  =  cos  and  y(x)  =  T(arccosx). 

(b)  The  equation  for  y  in  part  (a)  is  Legendre’s  differential  equation ,  and 
it  has  bounded,  continuous  solutions  on  [—1,1]  when 


X  =  Xn  =  n(n  +  1),  y  =  yn{pc)  =  Pn(x),  n  =  0, 1,  2, ... , 
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where  the  Pn  are  polynomial  functions  called  the  Legendre  polynomials. 
The  Legendre  polynomials  are  orthogonal  on  [—1,1],  and  they  are  given  by 
Rodrigues’  formula 


1  dn 
n\ 2n  dxn 


Go  through  a  formal  calculation  and  derive  a  formula  for  the  solution  to 
the  given  BVP  in  the  form 


oo 

u(p,<$>)  =  y^jcnpnPn{  cos  0), 

n— 0 


where 


/(</>)  Pn  (cos  (j>)  sin  cj)  d(j>. 


(c)  Find  the  first  four  Legendre  polynomials  using  Rodrigues’  formula. 

(d)  If  /(</>)  =  sin</>,  find  an  approximate  solution. 

5.  Estimate  the  age  of  the  earth.  Use  the  data  in  Exercise  5  of  Section  2.4. 


4.6  Diffusion  in  a  Disk 

In  the  last  section  we  solved  a  diffusion  problem  in  spherical  coordinates  and 
found,  assuming  angular  symmetry,  that  it  was  a  routine  application  of  the 
separation  of  variables  method.  In  this  section  we  solve  a  diffusion  problem  in 
polar  coordinates;  we  will  find  the  task  is  not  so  routine  because  we  obtain  a 
differential  equation  (Bessel’s  equation)  that  may  be  unfamiliar  to  the  reader. 
A  computer  algebra  system  may  help  illuminate  some  of  the  steps  along  the 
way.  (The  instructor  may  find  that  this  problem  is  well  suited  for  presentation 
in  a  computer  laboratory  setting.). 

Consider  a  circular,  planar  disk  of  radius  R  whose  initial  temperature  is 
a  function  of  the  radius  alone  and  whose  boundary  is  held  at  zero  degrees. 
We  are  interested  in  how  heat  diffuses  throughout  the  disk.  Intuition  dictates 
that  the  temperature  u  in  the  disk  depends  only  on  time  and  the  distance  r 
from  the  center,  or  u  =  u(r,t).  We  can  make  this  assumption  because  there 
is  nothing  in  the  initial  or  boundary  condition  to  cause  heat  to  diffuse  in  an 
angular  direction — heat  flows  only  along  rays  emanating  from  the  origin. 

We  know  that  u  must  satisfy  the  two-dimensional  heat  equation  ut  =  kAu 
in  0  <  r  <  R,  where  k  is  the  diffusivity  and  A  is  the  Laplacian  in  polar 
coordinates,  given  in  Section  1.8.  Therefore  the  PDE  is 
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The  boundary  condition  is 


0  <  r  <  R,  t  >  0. 


u(R ,  t)  =  0,  £  >  0, 


and  the  initial  condition  is 


n(r,  0)  =  /(r),  0  <  r  <  R, 


(4.71) 


(4.72) 


(4.73) 


where  /  is  a  given  radial  temperature  distribution.  It  is  understood  that  the 
temperature  should  be  bounded  at  r  =  0.  Equations  (4.71-4.73)  define  the 
initial  boundary  value  problem  we  want  to  solve. 

Now  separate  variables.  Taking  u(r,t)  =  y(r)g(t ),  the  PDE  splits  into 

=  y"(r)  +  jyg)  =  _A 

kg{t)  y(r) 

The  equation  for  g(t)  is  solved  in  the  usual  way  to  obtain 

g(t)  =  e~Xkt . 


The  radial  equation  is 


//  1  / 

V  (r)  +  -2/  (r)  =  —Xy(r). 
r 

Upon  multiplying  by  r,  we  can  rewrite  this  equation  as 


(4.74) 


n/(r))  =  A  ry{r), 


(4.75) 


which  is  Bessel’s  equation.  Condition  (4.72)  leads  to  the  boundary  condition 


y{R)  =  0, 

and  we  impose  the  boundedness  requirement 


(4.76) 


2/(0)  bounded. 


(4.77) 


Note  that  the  ODE  (4.75),  along  with  the  boundary  conditions  (4.76-4.77),  is 
a  singular  Sturm-Liouville  problem. 

It  is  easily  seen  that  A  =  0  is  not  an  eigenvalue;  in  this  case  the  general 
solution  of  (4.75)  is  y  =  a  +  51nr;  and  b  =  0  by  boundedness  and  a  =  0  by  the 
condition  y(R)  =  0.  We  leave  it  as  an  exercise  using  an  energy  argument  to 
show  that  there  are  no  negative  eigenvalues  (Exercise  1).  Therefore  we  consider 
the  case  where  A  is  positive.  In  the  last  section  we  found  a  simple  substitution 
for  a  similar  equation  y"  +  (2 / p)yr  =  A y  that  transformed  it  to  a  familiar 


204 


Bounded  Domains 


equation;  but  Bessel’s  equation  (4.74)  is  slightly  different  (by  just  a  factor  of 
2  in  one  term),  and  a  simple  change  of  variables  does  not  work.  A  successful 
approach  involves  the  use  of  power  series  methods  for  differential  equations; 
that  is,  one  assume  a  solution  of  the  form 

oo 

y(r )  =  Y  anrn 

n— 0 

and  substitute  into  (4.75)  to  determine  the  coefficients  ao,  ai,  a2r  . . .  This  cal¬ 
culation,  which  we  do  not  perform,  ultimately  leads  to  two  linearly  independent 
solutions  of  (4.74),  denoted  in  the  literature  by  Jo(VAr)  and  Yo(y/\r);  so  the 
general  of  (4.75)  is 

y(r)  =  c\Jo(VXr)  +c2Y0(V\r) 
where  c\  and  C2  are  arbitrary  constants. 


Remark  4.24 

Maple  and  MATLAB,  for  example,  denote  these  special  functions  Jo(z)  and 
Yq(z)  by  Bessel J,  BesselY  and  besselj,  bessely,  respectively.  These  are  the 
Bessel  functions  of  order  zero.  Their  properties,  graphs,  and  values  are  cat¬ 
alogued  in  these  computer  algebra  systems  as  well  as  handbooks.  □ 


Plots  of  the  Bessel  functions  are  shown  in  Figure  4.4.  It  is  clear  that  Yo(z) 
blows  up  near  z  =  0;  in  fact,  it  can  be  shown  that  Yq(z)  behaves  like  In z 
for  small,  positive  z.  Therefore,  we  discard  the  second  term  in  the  solution 
to  maintain  boundedness;  so  we  set  C2  =  0.  Therefore,  we  have  the  bounded 
solution  to  the  differential  equation  (4.75)  as 

y(r)  =  dJo(VXr).  (4.78) 


The  Bessel  function  Jq(z)  is  a  nicely  behaved  oscillatory  function.  It  is 
similar  to  the  cosine  function,  but  its  oscillations  decay  as  z  increases.  It  has 
infinitely  many  zeros  zn,  n  =  1,2,3,...;  the  first  few  are  z\  =  2.405,  Z2  = 
5.520,  z, 3  =  8.654,  2:4  =  11.790.  Its  series  representation  can  be  found  as 


Mz)  =  Y 

n= 0 


(-1)"  _2n 

n\  22n 


One  can  also  show  that  for  large  z, 


Mz) 


•) 
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Figure  4.4  MATLAB  plots  of  the  Bessel  functions  Jo(z)  and  Yq(z)  for  0  < 
z  <  15 


which  shows  its  “decaying  cosine”  behavior. 

Now  we  apply  the  boundary  condition  (4.76)  to  (4.78),  and  we  obtain 


y(R)  =  ciJo(^)  =  0. 


Thus 


VXR  =  zn  ,  z  =  1,2,3,..., 

where  the  zn  are  the  zeros  of  Jq.  Consequently,  the  eigenvalues  are 


z, 


An  —  ,  Tt  —  1,2,3,..., 


and  the  corresponding  eigenfunctions  are 


Vn(r)  =  Jo 


znr 

R 


,  n  =  l,2,3, 


In  summary,  we  have  constructed  solutions  gn(t)yn(r)  of  the  form 

znr 


e  XnktJ0 


R 


,  n  —  1,2,3, 


that  satisfy  the  PDE  and  the  boundary  conditions. 
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To  satisfy  the  initial  condition  (4.73)  we  form  the  linear  combination 


u(r,t)  =  cne  *-*J0  (*f)  . 


The  initial  condition  forces 


(4.79) 


u(r,  0)  =  f(r)  =  cnJo  (^)  • 


To  find  the  cn  we  use  orthogonality.  This  last  equation  is  a  Fourier-Bessel 
expansion,  and  the  Bessel  functions  satisfy  the  orthogonality  condition 

/  J°  i~w)  J°  (-jF)  rdr  =  n^m-  (4-8°) 

(See  Exercise  2.)  The  factor  r  in  the  integrand  is  a  weight  function,  and  the 
Bessel  functions  Jo  ( ^j- )  are  orthogonal  with  respect  to  r.  The  orthogonality 
allows  us  to  determine  the  coefficients  in  the  standard  way:  Multiply  equa¬ 
tion  (4.80)  by  rJo  and  integrate  from  r  =  0  to  r  =  R,  interchange  the 

summation  and  the  integral,  and  then  use  the  weighted  orthogonality  relation 
to  collapse  the  infinite  sum  to  one  term.  We  obtain 


where 


Therefore,  the  coefficients  are  given  by 


Jcf  f(r)Jo  (fjf)  rdr 

\0o  (¥)  II2 


(4.81) 


In  conclusion,  the  formal  solution  to  the  initial  boundary  value  problem  (4.71- 
4.73)  is  given  by  (4.79)  with  coefficients  given  by  (4.81). 

In  the  exercises  the  reader  is  asked  to  calculate  this  solution  numerically  in 
special  cases. 

We  conclude  by  making  some  brief  remarks  about  special  functions.  The 
Bessel  functions  Jo  and  Yq  encountered  above  are  just  two  examples  of  large 
classes  of  Bessel  functions.  Problems  in  cylindrical  geometry,  for  example,  the 
vibrations  of  a  drum  head,  lead  to  such  functions.  Moreover,  other  special 
functions  (Legendre  polynomials,  Laguerre  polynomials,  and  others)  occur  in  a 
similar  way  for  problems  in  other  coordinate  systems  and  domains.  That  is,  they 
occur  as  solutions  to  eigenvalue  problems  for  ordinary  differential  equations 
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that  arise  from  the  separation  of  variables  process.  Generally,  special  functions 
arising  in  this  manner  possess  orthogonality  properties  that  give  the  expansion 
of  the  solution  in  terms  of  those  functions. 


EXERCISES 

1.  Show  that  the  eigenvalue  problem  (4.75-4.77)  has  no  negative  eigenvalues. 
Hint:  Use  an  energy  argument — multiply  the  ODE  by  y  and  integrate  from 
r  =  0  to  r  =  R;  use  integration  by  parts  and  use  the  boundedness  at  r  =  0 
to  get  the  boundary  term  to  vanish. 

2.  Derive  the  weighted  orthogonality  relation  (4.80).  Hint:  Proceed  as  in  Sec¬ 
tion  4.4  for  regular  Sturm-Liouville  problems. 

3.  For  the  initial  boundary  value  problem  (4.71-4.73)  take  R  =  1,  k  =  0.25, 
and  /(r)  =  5r3(l  —  r).  Use  a  computer  algebra  program  to  determine  a  3- 
term  approximation  to  the  solution  using  (4.79).  Sketch  radial  temperature 
profiles  u(r,t)  for  several  fixed  values  of  t. 

4.  When  R  =  1  sketch  the  first  four  eigenfunctions  Jo  (^f),  n=  1,2, 3, 4. 

5.  (Computer  algebra  project)  Consider  the  boundary  value  problem 

y"  +  axy  =  0,  0  <  x  <  L, 

t/(0)  =  0,  y(L)  =  0, 

where  a  is  a  positive  constant.  Find  the  smallest  positive  value  of  L  for 
which  the  boundary  value  problem  has  a  nontrivial  solution.  The  solution 
is  that  L  can  be  calculated  from  the  equation 

\faL213  =  1.86635. 

Hint:  In  your  software  program  use  the  differential  equation  solver  to  find 
the  general  solution  as  a  linear  combination  of  the  two  Bessel  functions  Ji/ 3 
and  Yi  /3;  use  the  left  boundary  condition  in  limiting  form  to  determine  one 
constant  in  terms  of  the  other,  and  then  use  the  right  boundary  condition 
to  determine  the  appropriate  value  of  L. 


4.7  Sources  on  Bounded  Domains 

Readers  should  be  familiar  with  methods  for  solving  ordinary  differential  equa¬ 
tions  with  source,  or  forcing,  terms.  This  topic  is  facilitated  by  having  the  notion 
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of  a  general  solution  to  the  equation.  The  general  solution  to  the  second-order 
linear  equation 

Ly  =  y"  +  a(x)y'  +  b(x)y  =  f(x) 

is  y(x)  =  yh(x)  +  Vp(x)  where  y h  is  the  general  solution  to  the  homogeneous 
equation,  Lyh  =  0,  and  yp  is  any  particular  solution  to  the  nonhomogeneous 
equation,  Lyp  =  f(x).  For  simple  source  terms,  particular  solutions  are  usually 
found  by  the  method  of  undetermined  coefficients  (judicious  guessing)  or  by  a 
general  formula  called  the  variation  of  parameters  formula.  (See  Appendix  A). 

For  partial  differential  equations  the  situation  is  more  complicated.  We 
introduce  several  techniques  to  deal  with  both  nonhomogeneous  equations  and 
nonhomogeneous  boundary  conditions  on  bounded  spatial  domains  for  the  heat 
equation,  wave  equation,  and  Laplace’s  equation. 

One  obvious  approach  is  to  use  the  linearity  of  the  equation  and  boundary 
conditions  to  split  a  problem  into  simpler  forms,  essentially  breaking  apart  the 
inhomogenieties  in  the  problem.  Consider  the  initial  BVP  (P)  for  the  heat 
equation, 

{ut  ~  kuxx  =  f(x,  t),  a  <  x  <6,  t  >  0, 
u(a,  t)  =  6i(£),  u(b,  t)  =  62  W,  t  >  0, 
u(x,  0)  =  g(x),  t  >  0. 

Now  consider  the  two  problems 


{vt  ~  kvxx  =  0,  a  <  x  <b,  t  >  0, 
v(a,  t)  =  &i(£),  v(b,t)  =  b2(t),  t  >  0, 
v(x,  0)  =  g{x),  t  >  0, 

and 

{wt  ~  kwxx  =  /(x,  t),  a  <  x  <6,  t  >  0, 
w(a,t)  =  0,  w(b,t)  =  0,  t  >  0, 
w(x,  0)  =  0,  t  >  0. 

(Pi)  is  the  same  problem  as  (P)  with  no  source  term,  and  (P2)  is  the  same 
problem  as  (P)  with  homogeneous  boundary  conditions.  Using  linearity,  it  is 
easy  to  see  that  that  the  solution  u  to  (P)  is 


u(x,  t )  =  v(x,  t)  +  w(x,  £), 


where  v  and  w  are  the  solutions  to  (Pi)  and  (P2),  respectively.  Therefore, 
we  have  broken  down  (P)  into  two  simpler  problems.  The  initial  condition  is 
handled  by  Fourier  series. 

We  now  treat  several  cases.  First  we  use  Duhamel’s  principle  (see  Chapter  2 
for  the  unbounded  domain  case)  to  show  that  source  terms  can  put  into  initial 
conditions.  An  example  is  reviewed  for  the  heat  equation,  but  the  procedure 
works  equally  well  for  the  wave  equation  by  placing  the  source  in  the  intial 
condition  for  velocity. 
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Example  4.25 

(Duhamel’s  Principle)  Consider  a  simple  diffusion  model  given  by 

ut  ~  kuxx  =  f(x,  t),  0  <  x  <  7T,  t  >  0,  (4.82) 

'u(C),  t)  =  tx(7t,  t)  =  0,  t>  0,  (4.83) 

u(x,  0)  =  0,  0  <  x  <  7T.  (4.84) 

Interpreted  in  the  context  of  heat  flow,  u  =  ^(x,  £)  is  the  temperature  of  a 
rod  whose  initial  temperature  is  zero  and  whose  ends  are  maintained  at  zero 
degrees.  It  is  the  heat  source  f(x,t)  that  is  driving  the  system. 

The  simplest  way  to  solve  (4.82-4.84)  is  to  use  Duhamel’s  principle  as  formu¬ 
lated  in  Section  2.5  for  initial  value  problems  on  infinite  domains.  On  bounded 
domains  the  principle  is  the  same.  It  states  that  the  solution  of  (4.82-4.84)  is 
given  by 

u(x,t)  =  /  w(x,  t  —  r,  r)dr, 

Jo 

where  w  =  w(x,t,r)  is  the  solution  to  the  homogeneous  problem 


Wt  —  kwxx  =  0,  0  <  x  <  7r,  t  >  0, 
w{ 0,  t,  r)  =  re(7r,  t,  r)  =  0,  t  >  0, 
w(x,  0,  r)  =  /(x,  r),  0  <  x  <  tt. 


Recall  that  r  is  a  parameter.  We  have  already  solved  this  problem  in  Section  4.1. 
We  have 

oo 

w(x,t,r)  =  cne~n  kt  sinnx, 

n—  1 


where 


2  r 

cn  =  cn(r)  =  —  /  f(x,  t)  sinnx  dx. 

11  Jo 

Notice  that  the  Fourier  coefficients  cn  depend  on  the  parameter  r.  So,  the 
solution  to  (4.82-4.84)  is 


u(x,  t ) 


dr. 


(4.85) 


For  example,  solve  (4.82-4.84)  when  f(x,t)  =  sinx.  Easily  the  Fourier  coeffi¬ 
cients  are  given  by  cn  =  0,  n  >  1,  c\  =  1.  Then  the  solution  is 


f  c\e  k(yt  sin xdr 


u(x,  t ) 


JO 

r1 


sinx.  □ 
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Remark  4.26 

In  the  preceding  example,  the  limit  of  the  solution  as  t  oo  is 

lim  u(x,t)  =  (l/fc)sinx. 

t — yoo 

Notice  that  this  is  the  same  as  the  steady-state  solution,  i.e. ,  the  time- 
independent  solution  of  the  system 

Ut  —  kuxx  =  sinx,  0  <  x  <  7r,  t  >  0, 
u(0,t)  =  u(7r,t)  =  0,  t  >  0, 
u(x,  0)  =  0,  0  <  X  <  7T, 

found  by  solving  —U"{x)  =  sinx,  U( 0)  =  U(tt)  =  0. 

This  is  true  in  general.  If  the  source  term  in  (4.82-4.84)  in  the  heat  equation 
depends  only  on  x,  or  /(x,  t)  =  F(x),  then  as  t  oo  the  solution  to  (4.82-4.84) 
approaches  the  steady-state  solution  found  by  solving  —U"(x)  =  F(x),  U (0)  = 
U(n)  =  0.  □ 

Equations  with  sources  can  also  be  solved  by  an  eigenfunction  method. 
The  first  step  is  to  find  the  eigenfunctions  of  the  Sturm-Liouville  problem 
associated  with  the  homogeneous  problem,  with  no  sources. 


Example  4.27 

In  the  case  of  (4.82-4.84)  the  eigenvalues  and  eigenfunctions  of  the  homoge¬ 
neous  problems  are  (see  Section  4.1) 

A  n  =  n2,  yn(x)  =  sinnx,  n  =  l,2, .... 


Then  we  assume  a  solution  of  the  nonhomogeneous  problem  (4.82-4.84)  of  the 
form 

oo 

u(x,t)  =  gn(t)  sinnx,  (4.86) 

n= 1 

where  the  gn(t)  are  to  be  found.  We  determine  the  gn(t)  by  substituting  this 
expression  for  u  into  the  PDE  (4.82),  along  with  the  expression  for  the  eigen¬ 
function  expansion  for  /,  namely, 

oo 

f{x,t)  =  Yjn(t)  sin nx. 

n= 1 


In  this  last  expansion,  the  fn(t)  are  known  because  /  is  known;  the  fn(t)  are 
the  Fourier  coefficients 

2  r 

fn(t )  =  —  /  /(x,  t)  sinnxdx.  (4.87) 

71  J  o 
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Carrying  out  this  substitutions,  we  get 


777  E  9n(t )  sin  nx  -  k—^  E  5„(i)  sin  nx  =  E  fn(t)  sin nx, 


)2  00 


oo 


dt 


or 


n— 1 


oo 


&r2 


n=l 


n=l 


oo 


oo 


E  9'n  (* )  sin  nx  +  A:  E  n25n  (t )  sin  nx  =  E  /n(i)  sin  nx . 

n=l  n=l  n=l 

Collecting  the  coefficients  of  the  independent  eigenfunctions  sin  nx ,  we  obtain 

9n(t)  +  n2kgn(t)  =  fn(t), 

which  is  a  linear  ordinary  differential  equation  for  the  unknown  gn(t)-  It  can 
be  solved  in  the  standard  way  by  multiplying  by  the  integrating  factor  en  kt 
and  then  integrating  both  sides.  After  some  straightforward  manipulation  we 
obtain 

9n(t)  =  gn(0)e~n2kt  +  f  fn(r )e_"2fe^_’rW. 

Jo 

To  obtain  the  values  of  gn( 0)  we  substitute  the  initial  condition  into  (4.86)  to 
get 

OO 

gn  (0)  sin  nx  —  0, 

n— 1 

which  implies  gn{ 0)  =  0  for  all  n.  Therefore,  we  have  determined  the  gn(t),  and 
the  solution  to  the  problem  (4.82-4.84)  is 


u 


(x,t)  =  W  (  [  fn{r)e  ”2fc(*  r)dr)  sin  nx, 

n—1  'v'^°  ' 


(4.88) 


where  the  fn(t)  are  given  by  (4.87).  This  solution  formula  (4.88)  is  the  same 
as  the  solution  formula  (4.85)  obtained  by  Duhamel’s  principle.  □ 


In  summary,  in  the  eigenfunction  method  for  solving  problems  with  sources 
we  expand  both  the  source  /  and  the  assumed  solution  u  in  terms  of  the 
eigenfunctions  of  the  homogeneous  problem.  Substituting  these  expansions  into 
the  PDE  leads  to  ODEs  for  the  coefficients  in  the  eigenfunction  expansion 
for  u.  Solving  these  ODEs  for  the  coefficients  then  gives  the  solution  u  as 
an  eigenfunction  expansion.  Whereas  Duhamel’s  principle  is  applied  to  initial 
value  problems  (evolution  problems),  the  eigenfunction  expansion  method  can 
be  applied  to  all  types  problems,  including  equilibrium  problems  associated 
with  Laplace’s  equation. 

To  apply  the  separation  of  variables  method  we  need  homogeneous  bound¬ 
ary  conditions.  If  boundary  conditions  are  nonhomogeneous,  we  can  often  make 
a  change  of  the  dependent  variable  that  leads  to  homogeneous  ones. 
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Example  4.28 

(Homogenizing  the  boundary  conditions)  Consider  the  problem 

Ut  3 uxx  —  0?  0  x  <C  1,  t  0, 

1^(0,  t)  =  2e_t,  tz(1,£)  =  1, 
u(x,  0)  =  x2,  0  <  x  <  1. 

For  the  diffusion  equation  we  can  always  make  a  transformation  of  the  depen¬ 
dent  function  to  force  zero  boundary  conditions  at  the  expense  of  introducing 
a  source  term  in  the  PDE.  Define  w(x,t)  to  be  the  function  u(x,t)  minus  a 
linear  function  of  x  that  satisfies  the  boundary  conditions.  That  is,  take 

w(x,t)  =  u(x,t)  —  ^2e~f  +  ^1  —  2e~t^jxSj  . 

Then  w  solves  the  problem 

iv t  —  3 wxx  =  2e~t(l  —  x),  0  <  x  <  1,  t  >  0, 
w(0,t)  =  w(l,t)  =0,  t  >  0, 
w(x,  0)  =  x2  +  x,  0  <  x  <  1. 

This  problem  can  be  solved  using  the  eigenfunction  as  in  the  last  example.  □ 

Remark  4.29 

(Steady  heat  source)  For  the  heat  equation  with  a  steady  source, 

ut  ~  kuxx  =  /(x),  a  <  x  <6,  t  >  0, 
u(a,t)  =  A ,  u(b,t)  =  B,  t  >  0, 
u(x,  0)  =  g(x),  a  <  x  <  b, 

where  A  and  5  are  constant,  the  equilibrium  temperature  U  =  U(x)  satisfies 

-kU"  =  /(x),  U(a)  =  A ,  U(b)  =  B. 

The  substitution  w(x,t)  =  u(x,t)  ~  U(x)  leads  to  the  homogeneous  problem 

Wt  —  kwxx  =  0,  a  <  x  <  6,  t  >  0, 
w(a,t)  =  0,  w(b,  t)  =  0,  t  >  0, 
tz(x,  0)  =  G(x)  =  #(x)  —  ?7(x),  a  <  x  <b. 

This  problem  may  be  solved  using  separation  of  variables.  □ 
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Remark  4.30 

It  can  be  generally  stated  that  if  the  heat  equation  has  boundary  conditions  of 
the  form 

aiu(Q,t)  +  a2ux(0,t)  =  Pi u(l,t)  +  p2Ux(l,t)  =  g2(t), 

then  the  transformation 

l~i)  + 

where  a(t)  and  b(t)  are  chosen  so  that  U (x,  t )  satisfies  the  boundary  conditions, 
leads  to  a  problem  for  w  with  homogeneous  boundary  conditions.  The  source 
term  and  initial  condition  may  be  altered.  □ 

Example  4.31 

(Wave  equation)  Consider  the  nonhomogeneous  wave  equation 

'U'tt  —  ^ xx  T  Ax,  0  x  <C  1,  t  0, 
i^(0,  t)  =  0,  u{\,  t)  =  0,  t  >  0, 
u(x,  0)  =  0,  ut(x,  0)  =  0,  0  <  x  <  1. 

Here  there  is  an  equilibrium  solution  U  =  U(x)  satisfying 

-U"  =  Ax,  U  (0)  =  U(  1)  =  0. 

It  is  easily  found  by  integrating  twice  that  U (x)  =  ^x(^l  —  x2^j  .  Then,  w(x,  t)  = 
u(x,  t)  —  U (x)  satisfies 

Wtt  =  Wxx,  0<x<l,  t  >  0, 
w(0,  t)  =  0,  w(l,  t)  =  0,  t  >  0, 
w(x,  0)  =  —  U(x),  wt(x,  0)  =0,  0  <  x  <  1. 

Now  the  problem  can  be  solved  using  separation  of  variables.  □ 

Example  4.32 

Consider  the  wave  equation  with  nonhomogeneous  boundary  conditions: 

'U'tt  —  ^xxi  0  <C  X  <C  /,  t  0, 

u(0,t)  =  a(t),  u(l,t)  =  b(t),  t  >  0, 
u(x,  0)  =  0,  ut(x,  0)  =  0,  0  <  x  <  l. 


u 


w  +  U(x,t)  =  w  +  a(t)  ^ 
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The  reader  can  easily  verify  that  the  change  of  variables 

M*,t> = -  (m + m -a{t>x 


leads  to  a  problem  with  homogeneous  boundary  conditions  for  w.  Nonhomoge- 
neous  terms  are  introduced  in  the  resulting  PDE  and  initial  conditions.  □ 


EXERCISES 

1.  Use  Duhamehs  principle  as  formulated  for  the  wave  equation  in  Section  2.5 
to  find  the  solution  to 

Utt  -  C2UXX  =  f(x,  t),  0  <  X  <  IT,  t  >  0, 

u( 0,  t)  =  u(tt,  t)  =  0,  t  >  0, 
u(x,  0)  =  Ut(x,  0)  =  0,  0  <  X  <  7T. 

Use  the  eigenfunction  method  to  solve  the  problem  in  Exercise  1.  Hint:  In 
the  eigenfunction  method  you  will  have  to  solve  a  second-order  nonhomo- 
geneous  ODE,  which  can  be  done  using  variation  of  parameters. 

In  the  problem  defined  by  (4.82-4.84)  assume  that  the  heat  source  does 
not  depend  on  time,  i.e.,  f(x,t)  =  F(x).  By  calculating  the  r-integral, 
show  how  the  solution  (4.85)  simplifies  in  this  case.  Find  a  formula  for  the 
solution  when  the  heat  source  is  F(x)  =  x(tt  —  x)  and  then  find  the  limiting 
temperature  profile  as  t  oo.  Show  that  this  limiting  temperature  profile 
is  the  same  as  the  steady-state  solution,  that  is,  the  solution  v  =  v(x)  to 

—kv"(x)  =  x(tt  —  x),  0  <  x  <  7r;  v(0)  =  v(? r)  =  0. 

4.  Solve 

Uf 

u(  0,  t) 
u(x ,  0) 

5.  Solve 

Ut  —  'LL xx  F  Q 5  0  X  <C  1,  t  0, 

i^(0,  t)  =  0,  u(l,  t)  =  2uo,  t  >  0, 
u(x,  0)  =  Uq(1  —  COS7rx),  0  <  X  <  1. 


—  LLxxi  0  X  1,  t  0, 
=  0,  u(  1,  t)  =  A,  t  >  0, 
=  cosx,  0  <  x  <  1. 


2. 


3. 


4.7  Sources  on  Bounded  Domains 


215 


6.  Transform  the  problem 

Uf  —  kuXXl  0  <C  x  <C  1,  7  0, 

7/(0, 7)  =  cos7,  hu(l,  7)  +  ux(l,  7)  =  1,  7>  0, 
u(x,  0)  =  sin7rx  +  x,  0  <  x  <  1, 

into  a  problem  with  homogeneous  boundary  conditions.  Give  a  physical 
interpretation  of  the  boundary  conditions. 

7.  Solve  the  problem 

Ut  =  kuxx  +  sin  37 nr,  0  <  x  <  1,  7  >  0, 

7/(0,  7)  =  7/(1,  7)  =  0,  7  >  0, 

7/(x,0)  =  sin7rx,  0  <  x  <  1. 

8.  Solve  the  wave  equation  with  a  constant  gravitational  force: 

Utt  =  c2uxx  -  g,  0  <  X  <  1,  t  >  0, 

7/(0,  7)  =  7/(1,  7)  =0,  7  >  0, 
t/(x,  0)  =  ut(x,  0)  =  0,  0  <  x  <  1. 


9.  Solve 

utt  =  uxx  +  q ,  0<x  <  1,  7  >  0, 

7/(0,  7)  =  0,  7/(1,  7)  =  sin  7,  7  >  0, 

7/(x,  0)  =  x(l  —  x),  7/t(x,  0)  =  0,  0  <  x  <  1. 

10.  Use  Duhamehs  principle  to  find  a  bounded  solution  to 

ut  =  At/  +  /(r,  7),  0  <  r  <  77,  7  >  0, 
t/(77,  7)  =  0,  7  >  0, 

T/(r,  0)  =  0,  0  <  r  <  77. 

11.  Find  the  solution  of  the  diffusion  problem  on  a  disk: 

1 

ut  =  -  ( rur)r  +  /(r),  r  <  77,  7  >  0, 

t/(77,  7)  =  7/o,  7  >  0, 

7/(r,  0)  =  g(r),  r  <  77. 
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4.8  Poisson’s  Equation* 

One  of  the  fundamental  equations  in  electrostatics  and  in  steady  heat  flow  is 
Poisson’s  equation  with  a  Dirichlet  boundary  condition: 

A u  =  uxx  +  uyy  =  /(x,  y),  (x, y)  G  17, 

u(x,y)  =  0,  (x,  ?/)  G  917. 

A  basic  result  is  that  the  nonhomogeneous  source  term  f(x,y)  can  be  put 
into  the  boundary  condition  by  changing  the  dependent  variable;  conversely, 
a  nonhomogeneous  boundary  condition  can  be  put  into  a  source  term  in  the 
PDE. 

1.  If  A u  =  f  on  17  and  u  =  g  on  917,  let  u(x,  y)  =  w(x,  y)  +  u(x,  y)  where  v  is 
any  particular  solution  of  Av  =  f.  Then  A u  =  Aw  +  Av  =  f.  So,  Aw  =  0. 
On  the  boundary  917,  u  =  w  +  v  =  g,  where  v  is  the  particular  solution 
above.  So  w  =  g  —  v  on  9 17.  In  summary, 

Aw  =  0  in  17;  w  —  g  —  v  on  917. 

2.  Let  A u  =  /  in  17  and  u  =  g  on  917.  Let  u  be  any  function  that  satisfies 
the  the  boundary  condition.  Let  u(x,y)  =  w(x,y)  +  v(x,y)  and  A u  = 
Aw  +  Av  =  /,  or  Aw  =  f  —  Av.  On  the  boundary  917,  we  have  u  =  w  +  v 
or  g  =  re  +  g.  Thus 


Are  =  /  —  Au  in  17;  w  =  0  on  917. 

In  summary,  this  result  states  that  the  Dirichlet  problem  for  Poisson’s  equa¬ 
tion  can  be  solved  if  the  Dirichlet  problem  for  Laplace’s  equation  can  be 
solved. 

Example  4.33 

For  simple  regions  17,  finding  a  particular  solution  is  easy  if  f(x,y)  is  a  poly¬ 
nomial  of  degree  n.  Just  assume  a  particular  solution  in  the  form  of  an  n  +  2 
degree  polynomial.  For  example,  consider  Poisson’s  equation  A u  =  —2  on  a 
rectangle  0  <  x  <  a,  0  <  y  <  b,  with  zero  Dirichlet  boundary  conditions.  Take 

u(x,  y)  =  A  +  Bx  +  Cy  +  Dx2  +  Exy  +  Fy2 , 

a  second  degree  polynomial.  Substituting  into  the  PDE  gives  2D  +  2F  =  —2  or 
D  +  F  =  —  1.  So  choose  F  =  0  and  D  =  —  1.  The  other  constants  are  arbitrary. 
Choose  B  =  a.  Then,  v(x,y)  =  ax  —  x2  is  a  particular  solution.  Therefore 
Are  =  0  in  17  and  re  =  —  u(x,  y)  on  917;  in  particular,  w(0,  y)  =  re(a,  y)  =  0  and 
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w(x,  0)  =  w(x,  b)  =  x2  —  ax.  This  problem  for  w  can  be  solved  by  separation 
of  variables  because  it  has  homogeneous  boundary  conditions  at  x  =  0  and 
x  =  a.  □ 


Example  4.34 

On  a  disk  of  radius  77,  solve  the  boundary  value  problem 


A u  =  2,  0  <  r  <  77, 

1^(77,  0)  =  0,  0  <  6  <  27 r. 


In  polar  coordinates,  the  PDE  is 


1 

r 


(rur )  +  =  2 


Because  the  boundary  condition  and  source  are  independent  of  0,  this  problem 
has  a  solution  that  depends  only  on  r.  By  direct  integration  of  the  PDE, 

l 

u(r ,  6)  =  -r2  +  ci  In  r  +  c2, 


where  ci  and  c2  are  arbitrary.  Clearly  ci  =  0  by  boundedness.  Then  u  (R,  9)  =  0 
forces  c2  =  — 772 / 2,  giving 

-i 

u(r, 9)  =  -  [r2  -R2).  □ 


Eigenvalue  Problems 

A  problem  of  great  importance  is  to  determine  the  eigenvalues  of  spatial 
differential  operators  on  bounded  domain. 

Example  4.35 

(Eigenvalues  of  the  Laplacian)  Consider  the  two-dimensional  problem 

—A u  =  A u,  x  G  72,  (4.89) 

u  =  0,  x  E  df2.  (4.90) 

Generally,  !?  is  a  two  dimensional  domain  with  boundaries  along  the  coordi¬ 
nate  direct  ions.  To  find  the  eigenvalues  A  we  use  separation  of  variables.  The 
negative  sign  on  the  Laplacian  gives  positive  eigenvalues.  Specifically,  we  take 
!?:0<£<7t,  0  <  y  <  tt.  Then  the  PDE  is  —  (uxx  +  uyy)  =  Xu  for  u  =  u(x,y). 
The  Dirichlet  boundary  condition  becomes  u( 0,  y)  =  ^(tt,  y)  =  0  for  0  <  y  <  tt, 
and  u(x,  0)  =  u(x,  tt)  =  0  for  0  <  x  <  tt.  We  assume  a  separable  solution  of 
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the  form  u(x,y)  =  X(x)Y(y )  for  some  functions  X  and  Y  to  be  determined. 
Substituting  gives 

-X"(x)Y(y)  -  X{x)Y"{y)  =  A X(x)Y(y), 
which  can  be  rewritten  as 

X"(x)  Y"(y)  +  A  Y(y) 

X(x)  Y(y) 

with  a  function  of  x  on  one  side  and  a  function  of  y  on  the  other.  Therefore, 

X"(x)  Y"(y)+\Y(y) 

X(x)  Y{y)  M 

for  some  separation  constant  fi.  Hence,  the  PDE  separates  into  two  ODEs  for 
the  two  spatial  variables: 

-X"(x)  =  iiX(x),  —Y"(y)  =  (A  -  y)Y (y). 

Next  we  substitute  the  assumed  form  of  u  into  the  boundary  conditions  to 
obtain 

X(0)  =  X(n)  =  0  and  T(0)  =  Y(n)  =  0. 

Consequently  we  have  obtained  two  boundary  value  problems  of  Sturm- 
Liouville  type, 

-X"(x)  =  yX(x ),  X(0)  =  X(tt)  =  0, 

-Y"(y)  =  (A  -  y)Y(y),  Y( 0)  =  Y(tt)  =  0. 

The  first  problem  has  been  solved  earlier  in  this  chapter  and  we  have  eigenpairs 

Xn{x)  =  sinnx,  yn  =  n2,  n  =  1,2,3,.... 

Then  the  Y  problem  becomes 

-Y"(y)=  (A-n2)F(y),  F(0)  =  F(f)  =  0. 

This  problem  is  also  of  Sturm-Liouville  type  and  will  have  nontrivial  solutions 
Yk(y)  =  sin  ky  when  A  — n2  =  /c2,  for  k  =  1,  2,  3, ...  Therefore,  double  indexing  A, 
we  get  eigenvalues  and  eigenfunctions  for  the  negative  Laplacian  with  Dirichlet 
boundary  conditions  as 

un,k{%,  V)  =  sin  nx  sin  ky,  \n,k  =  n  +fc,  n,  k  =  1,  2,  3, ... 

Observe  that  there  are  infinitely  many  positive,  real  eigenvalues,  whose  limit  is 
infinity,  and  the  corresponding  eigenfunctions  are  orthogonal,  that  is, 

/  (smnx  sin  ky)(smmx  sin  ly)dxdy  =  0  if  n  ^  m  or  k  ^  l.  □ 

J  Q 
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This  example  illustrates  what  to  expect  from  the  Dirichlet  problem  (4.89- 
4.90).  In  fact,  the  following  properties  hold,  where  we  use  the  vector  notation 
x  =  (x,y). 

1.  The  eigenvalues  are  real. 

2.  There  are  infinitely  many  eigenvalues  that  can  be  ordered  as  0  <  Ai  < 
A2  <  A3  <  •  •  •  with  An  — >  +00  as  n  — >  00. 


3.  Eigenfunctions  corresponding  to  distinct  eigenvalues  are  orthogonal  with 
inner  product  (u,v)  =  fn  u(x)v(x)dx. 

4.  The  eigenfunctions  un(x)  form  a  complete  orthogonal  set  in  the  sense  that 
any  square-integrable  function  /(x)  on  17  can  be  uniquely  represented  in 
its  generalized  Fourier  series 


00 


/(x)  =  y>wn»(x),  Cn  = 


/,  un 


n=  1 


U 


n 


n 


where  the  cn  are  the  Fourier  coefficients,  and  the  norm  is  u 
Convergence  of  the  series  is  in  the  L2(l 7)  sense,  meaning 

2 

cnun{x)  I  dx  0  as  N  00 


\J m  ^n) 


/  (/M  - 1 

V  n=l 


The  results  are  exactly  the  same  when  the  Dirichlet  boundary  condition 
(4.90)  is  replaced  by  a  Neumann  boundary  condition  ^  =  0,  with  the  exception 
that  A  =  0  is  also  an  eigenvalue,  with  eigenfunction  uo(x)  =  constant.  When 
a  Robin  boundary  condition  ^  +  a(x)u  =  0  is  imposed,  again  there  is  a  zero 
eigenvalue  with  constant  eigenfunction  provided  a(x)  >  0.  If  a(x)  fails  to  be 
nonnegative,  then  there  may  also  be  negative  eigenvalues. 


Remark  4.36 

The  previous  results  extend  to  the  higher  dimension  Sturm-Liouville  problem 

— div  (p  grad  u)  +  qu  =  Aum,  x  G  ft. 

Here,  w  =  w{x)  >  0,  p  =  p(x)  >  0,  q  =  <7(x),  p,  q  and  w  are  continuous  on  17, 
and  p  has  continuous  first  partial  derivatives  on  ft.  The  boundary  conditions 
may  be  Dirichlet  (u  =  0  on  d ft)  Neumann  (^  =  0  on  d  17),  or  Robin  + 
a(x)u  =  0  on  917,  with  a(x)  >  0).  In  these  cases  the  eigenfunctions  u  and  v 
are  orthogonal  with  respect  to  the  inner  product 

'  u(x)v(x)w(x)  dx, 

Q 


(' U,V)W 
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and  \\u\\w  =  \J Ju,  u)w.  The  Fourier  series  takes  the  form 


oo 


/(x)  =  Y^CnUn(X),  Cn 


f,  Un 


n 


U 


n 


w 

2  ’ 
w 


where  convergence  is 


V  n=l 


cnun(x)  w(x)  dx  0  as  N  oo.  □ 


Just  as  for  one-dimensional  Sturm-Liouville  problems,  the  completeness  of 
the  set  of  eigenfunctions  allows  us  to  solve  the  nonhomogeneous  problem. 


Example  4.37 

(Poisson’s  equation)  Consider  the  problem 

—A u  =  p(x),  x  G  17,  (4.91) 

u(x)  =  0,  x  G  917.  (4.92) 

This  problem  can  be  interpreted  in  either  one,  two,  or  three  dimensions,  and  the 
weight  function  is  1.  Let  An,  un(x),  n  =  1,2,3. ..  be  eigenvalue-eigenfunction 
pairs  for  the  negative  Laplacian,  i.e., 


—A u  =  Xu,  x  G  17, 
u(x)  =  0,  x  G  917. 

Next,  assume  a  solution  to  (4.91-4.92)  of  the  form1 


u 


(x)  = 


where  the  cn  are  to  be  determined.  Further,  expand  p  in  terms  of  the  eigen¬ 
functions  as 

\Piun 

p(x)  =  y  y/9„MTO(x),  Pn  = 


u 


Substituting  into  the  PDE  (4.91)  gives 


A U  —  ^  ^  Cn^Un  —  ^  ^  CnXnUn  —  ^  ^  Pn^n  ■ 


Therefore 


CnXn  —  Pm  Tl  —  1,2,3, 


All  sums  are  over  n  —  1,  2,  3, ... . 


9 
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If  An  7^  0  for  all  n,  then 


c 


P 


n 


n 


A 


n 


for  all  n.  Therefore,  we  have  the  solution  representation 


u(yp) 


Y  y-un(x) 

A  T) 


It  is  instructive  to  write  this  as 


u(yp)  = 


E 


E 

E 


(Pj  un) 

A n  1 1  1 1  ^ 

1 


^n(x) 


A 


n 


n 


E 


^n||  JQ 

un(y)un(x.) 
An  1 1  Wn  1 I  ^ 


p(y)un(y)dyun{x.) 


p(y)dy. 


If  we  define  the  Green’s  function  by 


G(x,y)  =  Y 


un(y)un(x.) 


A  n  1 1  U 


n  ^ n 


Then  the  solution  may  be  written  simply  as 


[  G{x,y)p(y)dy. 
J  n 


Thus,  the  solution  is  a  weighted  average  of  contributions  at  every  point  of  the 
domain.  A  different  interpretation  is  that  the  right  side,  an  integral  operator 
with  kernel  G,  is  the  inverse  operator  of  —A  with  a  Dirichlet  boundary  condi¬ 
tion.  In  other  words  the  solution  to  —A u  =  p  is  u  =  (— A)_1p.  □ 


Example  4.38 

(Neumann  problem)  If  we  consider  the  Neumann  problem 

—A u  =  p(x),  x  G  17, 

du  .  .  ^  _  _ 

—  (x)  =0,  x  G  917, 
an 

then  Ai  =  0  is  an  eigenvalue  of  the  negative  Laplacian  with  nontrivial  eigen¬ 
function  ui(x)  =  1.  Then  the  same  calculation  as  above  repeats  and  we  obtain 
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Now  An  7^  0  for  all  n  >  1,  and  we  get 


c 


n 


Pn 


n  >  1. 


However,  in  the  case  Ai  =  0,  we  get  c\  •  0  =  p\.  If  pi  ^  0,  then  this  problem 
has  no  solution.  However,  if  pi  =  0,  that  is,  (p,  u\)  =  0,  then  c\  is  arbitrary. 
Therefore, 


oo 


U 


n—2 


(x)  =  Cl  +  Y-un{x) 


=  Cl  + 


Q 


oo 

E 

vn= 2 


un(y)un(x) 


^n.  1 1  U 


n  ^ n 


p(y)dy ,  ci  arbitrary. 


In  summary,  if  p  is  orthogonal  to  the  eigenfunction  r^i(x)  =  1,  there  are 
infinitely  many  solutions.  If  p  is  not  orthogonal  to  'Ui(x)  =  1,  then  there  is 
no  solution.  □ 


This  latter  result  can  be  generalized  as  follows. 


Theorem  4.39 

(Fredholm  alternative)  Consider  the  boundary  value  problem 

— div  (p(x)  gradr^)  +  q(x.)u  =  pu  +  /(x),  x  G  17, 

with  homogeneous  Dirichlet,  Neumann,  or  Robin  boundary  conditions,  where 
the  coefficient  functions  satisfy  the  conditions  in  Remark  4.32,  and  /  is  a  given 
function.  Then, 

(a)  If  p  is  not  an  eigenvalue  of  the  corresponding  homogeneous  problem  (/  = 
0),  then  there  is  a  unique  solution  for  all  functions  /  with  ff?  /(x)dx  <  oo. 

(b)  If  p  is  an  eigenvalue  of  the  homogenous  problem,  then  there  is  no  solution 
or  infinitely  many  solutions,  depending  upon  the  function  /. 

The  proof  of  this  theorem  is  straightforward  and  follows  exactly  the  for¬ 
mat  of  the  proofs  in  Section  4.2,  namely,  to  expand  u  and  /  in  eigenfunction 
expansions  and  solve  for  the  coefficients  of  u. 

Green’s  Function  for  Infinite  Domains 

Solving  Poisson’s  equation  on  bounded  domains  can,  in  theory,  be  done  by 
the  eigenfunction  method  discussed  above.  In  practice,  the  domain  must  be 
simple  for  success.  For  infinite  domains  we  illustrate  a  physical  approach  based 
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on  point  sources.  Recall  that  (Section  2.1),  for  the  heat  equation,  we  found  the 
solution  to  a  problem  with  a  unit,  point  source  and  then  superimposed  those 
solutions  over  a  distribution  of  sources  to  obtain  the  solution  to  the  Cauchy 
problem.  For  Poisson’s  equation  we  appeal  to  electrostatics  in  two  dimensions. 
(Equally,  one  could  think  in  terms  of  unit  heat  sources.)  A  unit  positive  charge 
placed  at  the  origin  induces  an  electric  field  whose  potential  is  given  by 

1 


U(x,y)  = - In  r,  r  =  \/  x2  +  y2 

2tt 

which  satisfies  Laplace’s  equation.  The  potential  is  not  defined  at  (0,0).  The 
equipotential  curves  are  clearly  circles  centered  at  the  origin,  and  consistent 
with  the  fact  that  for  a  unit  charge,  the  flux  through  an  arbitrary  circle  Cr  of 
radius  r  is 


•27 r 


flux  = 


grad  u  ■  nds  =  i 
Cr  JO 


urr  drdO  =  1 


By  simple  translation,  the  potential  response  at  (x,  y)  caused  by  a  unit  point 
charge  at  any  point  (£,  rj)  is 


1 


G(x,  y ,  £,  rj)  =  -  —  In  \/(x-  £)2  +  (y  -  rj)2, 


(4.93) 


which  is  Green’s  function  in  M2.  Arguing  exactly  as  for  the  heat  equation,  we 
can  solve  Poisson’s  equation 


Uxx  +  Uyy  =  f(x,y),  (x,y)eM: 


(4.94) 


by  treating  the  source  term  /  as  a  distributed  set  of  point  sources  at  (£,  rj)  of 
magnitude  /(£,  rj)  for  all  £  and  rj.  By  linearity  we  can  superimpose  these  point 
source  solutions  /(£,  rj)G(x,  y,  £,  77)  over  all  space  to  obtain 


u(x>y)=  G(x,  y,  £,  ??)/(£,  f?)  d^dy, 

it2 


which  is  the  solution  to  (4.94).  Here,  we  assume  that  the  source  vanishes  suffi¬ 
ciently  rapid  at  infinity. 


Example  4.40 

(Green’s  function  in  upper-half  plane)  For  infinite  domains  having  bound¬ 
aries  with  simple  geometry,  we  can  modify  the  Green’s  function  above  to  derive 
a  solution.  Consider  Poisson’s  equation  in  the  upper-half  plane: 


Uxx  +  Uyy  =  /( X,  y),  x  G  R,  y  >  0, 
u(x,  0)  =  0,  x  €  M. 


(4.95) 

(4.96) 
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For  a  point  charge,  G(x,  y ,  £,  77)  the  Green’s  function  (4.93)  does  not  satisfy 
the  boundary  condition  at  y  =  0.  So  we  introduce  a  negative  point  charge 
at  the  point  (£,  —  rj)  to  counter  the  positive  point  charge  at  (£,77).  The  Green’s 
function  for  this  image  charge  is  G(x,  y ,  £,  —77).  See  Figure  4.5.  Therefore  the  net 
potential  at  an  arbitrary  point  (x,  y)  in  y  >  0  is  the  sum  of  the  two  potentials 
from  both  charges,  or 

1  1  \  ~R 

G(x.  y ,  G  n)  = - In  R  H - In  R  =  —  In  — , 

v  2tt  2tt  2tt  R' 

where  R  and  R  are  the  distances  shown  in  the  figure,  which  are  easily  calculated 
by  the  distance  formula.  Easily  G  satisfies  the  zero  boundary  condition  at 
y  =  0.  Therefore,  G  is  Green’s  function  for  the  upper-half  plane.  Again,  by 
superposition,  the  solution  to  (4.95-4.96)  is 


u(x,y) 


/(£>  v)  d£dr). 
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Figure  4.5  A  unit  (+1)  charge  placed  at  (£,  77)  and  its  negative  image  charge 
(— 1)  at  (£,  —77)..  The  distances  between  those  charges  and  an  arbitrary  point 
(x,  y)  are  R  and  R ,  respectively,  which  are  functions  of  x,  y ,  ,  77 


The  procedure  in  this  example  is  called  the  method  of  images,  and  it 
can  be  applied  for  simple  domains  in  both  2  and  3  dimensions.  See  Strauss 
(1992)  or  Haberman  (2013),  for  example,  for  a  thorough  discussion  of  Green’s 
functions. 


EXERCISES 

1.  Solve  the  Dirichlet  problem  on  a  disk  of  radius  r  =  2: 

Uxx  +  Uyy  =  0,  x2  +  y2  <  2.  u(x,  y)  =  sin  9,  x2  +  y2  =  2. 
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2.  On  a  disk  of  radius  R  solve 

A u  =  —A  in  0  <  r  <  R,  u  =  1  on  r  =  R. 

3.  Solve  the  Poisson  equation  A u  =  1  on  the  unit  disk  with  boundary  condi¬ 
tion  u(l,Q)  =  sin  0. 

4.  Solve  A u  =  1  on  the  annulus  Rq  <  r  <  R\  with  u  =  0  at  r  =  Ro,  r  =  R\. 

5.  On  the  disk  of  radius  R  in  the  plane  solve 

A u  =  /(r),  r  <  1,  u  —  uq,  r  =  R, 

where  uq  is  a  constant. 

6.  Solve  the  boundary  value  problem 

A  u  =  /(r),  0  <  r  <  A*,  0  <  0  <  27r, 
u(R,  0)  =  g(6 ),  0  <  6  <  2n. 

7.  In  M2  find  the  eigenvalues  and  eigenfunctions  of  the  Neumann  problem 

A  A  „  (llL 

—A u  =  Xu,  x  G  S2,  —  =  0,  x  G  oil, 

an 

where  Q  is  the  rectangle  0  <  £  <  7r,  0  <  y  <  1. 

8.  Use  the  eigenfunction  method  to  solve  the  Dirichlet  problem  for  the  Poisson 
equation: 

uxx  +  =  / (x, 2/),  0  <  x  <  7T,  0  <  y  <  1, 

^(0,2/)  =  u(7r,y)  =  0,  0<2/<l, 

0)  =  1)  =  0,  0  <  X  <  7T. 

Hint:  Assume  u(x,t)  =  9n(y)  sinnx.  Substitute  into  the  PDE  and 

boundary  conditions  at  y  =  0  and  y  =  1  to  obtain 

y"(y)  -  n2gn{y )  =  /„(y),  pn(0)  =  5„(1)  =  0, 

where  fn(y )  =  Lj  fj  f(x,  y)  sin  nx  dx.  Write  the  general  solution  in  the 
form 

9n(y)  =  cieny  +  c2e~ny  +  -  f  /„(£)  sinh n(y  - 

n  Jo 

and  determine  the  constants  c\  and  C2. 

9.  Consider  Laplace’s  equation  on  a  unit  sphere. 
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a)  Assume  a  solution  of  the  form  u(r,  0 ,  </>)  =  Sn((j))rn ,  n  —  0, 1,  2, . . and 
show  that  5n  (</>),  the  spherical  harmonics ,  satisfies  the  equation 

5"  +  +  n(n  +  l)Sn  =  0. 

sm</> 


b)  Change  the  independent  variable  to  x  =  cos  with  Pn(x)  =  5n  (</>), 
and  show 

^1  —  x2^j  P"  —  2  xP'n  +  n{n  +  1  )Pn  =  0. 

This  is  Legendre’s  differential  equation. 

c)  Assume  a  power  series  solution  of  Legendre’s  equation  of  the  form 


oo 

Pn(x)  = 

k=0 


and  show  that  for  each  fixed  n, 

k(k  +  1)  —  n(n  +  1) 


&k+ 2 


{k  +  2)(fc  +  1) 


afc,  k  =  0, 1,  2, . . . . 


d)  Show  that  for  each  fixed  n,  there  is  a  polynomial  solution.  These 
are  called  the  Legendre  polynomials.  Up  to  a  constant  multiple,  find 
the  first  four  Legendre  polynomials  Po  (x),...,P3  (x).  [Note:  It  can  be 
shown  that  the  Legendre  polynomials  are  orthogonal  on  —  1  <  x  <  1. 

10.  Consider  the  partial  differential  operator 

Lu  =  — div  (p  gradr^)  +  qu ,  x  E  i?, 

where  p  =  p(x)  >  o,  q  =  g(x),  p  and  q  are  continuous  on  i?,  and  p  has 
continuous  first  partial  derivatives  on  i?. 


a)  Prove  the  integration  by  parts  formula 


vLudx=  I  uLv dx  +  /  p  (  u^-  —  v^-  I  dA. 

f2  J  f2  J  df2 


dn  dn 


b)  Consider  the  eigenvalue  problem  Lu  =  A u,  x  G  17,  with  a  Dirichlet 
boundary  condition  u  =  0,  x  E  dft.  Prove  that  the  eigenvalues  are 
positive  and  that  distinct  eigenvalues  have  corresponding  orthogonal 
eigenfunctions. 

11.  Use  the  method  of  images  to  find  the  Green’s  function  in  the  first  quadrant 
of  the  plane,  x  >  0,  y  >  0,  with  zero  conditions  on  the  boundaries  x  = 
y  =  0.  Hint:  Place  appropriate  image  charges  in  the  second  and  fourth 
quadrants. 
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12.  Find  Green’s  function  G(r,  0,  p,  0)  for  the  unit  circle  with  zero  boundary 
condition.  Hint:  Find  the  potential  at  (r,  6)  due  to  unit  source  charge  (p,  0) 
inside  the  circle.  Place  an  image  charge  outside  the  circle  at  (1/p,  0). 

13.  Show  that  Green’s  function  for  M3  is  given  by  G(x,  y)  =  1/4ttR,  which  is 
the  potential  at  x  caused  by  a  point  source  at  y,  and  R  =|  x  —  y  |.  Hint: 
Find  the  radial  solution  to  Laplace’s  equation  that  has  unit  flux  across  any 
sphere  containing  the  origin. 


5 

Applications  in  the  Life  Sciences 


In  Chapter  1  we  introduced  simple  advection  and  diffusion  models  to  describe 
the  motion  of  organisms,  cells,  and  chemicals  in  a  biological  science  context.  In 
this  chapter  we  extend  these  ideas  to  more  complicated  phenomena  involving 
age  structure  of  a  population,  the  propagation  of  epidemic  waves,  and  the 
relationship  between  spatial  pattern  formation  and  chemical  instability.  These 
advanced  models  show  why  PDEs  have  vast  applications  in  the  life  sciences. 
The  mathematical  methods  we  introduce  to  analyze  these  problems  extend  the 
ideas  and  techniques  presented  in  the  earlier  chapters.  For  collateral  reading, 
extensive  applications  of  PDEs  in  biology  can  be  found  in  Edelstein-Keshet 
(2005),  Kot  (2001),  Britton  (2003),  and  Murray  (2003). 


5.1  Age-Structured  Models 

By  an  age-structured  model  we  mean  a  demographic  model  where  the  pop¬ 
ulation  at  time  t  has  an  age  distribution  superimposed  on  it.  Thus,  at  time 
t  we  also  consider  ages  of  the  individuals.  Age- structured  models  are  akin  to 
more  general  physiologically-structured  models  where  any  other  variable, 
say  size  or  weight,  could  replace  age.  For  example,  in  many  non-mammalian 
populations  the  evolution  of  the  population,  especially  the  mortality  rate,  cer¬ 
tainly  depends  upon  the  size  of  the  animals;  the  survival  probability  is  small 
for  young  fish  or  insects.  Our  modeling  focuses  on  age  structure,  but  similar 
ideas  apply  to  any  physiological  structure. 
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We  begin  by  considering  a  population  of  female  organisms  whose  age- 
structure  at  time  t  —  0  is  given  by  /(a).  That  is,  f(a)da  is  approximately 
the  number  of  females  between  age  a  and  a  +  da.  Generally,  we  use  females 
in  demographic  models  because  they  have  a  well-defined  beginning  and  end  to 
their  reproductive  capacity.  For  simplicity,  we  write  the  age  range  as  0  <  a  <  oo 
even  though  the  age  of  death  is  finite.  Given  the  initial  age  structure,  the  mor¬ 
tality  rate,  and  the  fecundity  rate  (average  number  of  offspring  produced  per 
female),  the  problem  is  to  determine  the  age  structure  u  =  u(a,t)  at  any  time 
t  >  0.  Here,  u(a,  t)da  represents  the  number  of  females  at  time  t  between  the 
ages  a  and  a  +  da.  Then  the  total  female  population  at  time  t  is 

nOO 

N(t )  =  /  u(a,  t)da. 

Jo 

What  makes  this  problem  more  intractable  than  other  problems  is  that 
u(0,t),  which  is  the  number  of  newborns  at  time  t,  is  not  known  a  priori. 
Rather,  u( 0,  t)  depends  upon  the  population  and  age  of  reproducing  females, 
which  is  the  unknown  in  the  problem.  Specifically,  we  assume  that  the  per  capita 
mortality  rate  is  given  as  a  function  of  age  as  m  =  m(a).  And  we  assume  that 
the  fecundity  rate  is  b(a,  £),  which  depends  upon  both  time  and  age  of  the 
female.  Precisely,  b(a,t)  is  the  average  number  of  offspring  per  female  of  age 
a  at  time  £;  the  function  b(a,  t )  is  called  the  maternity  function.  We  expect 
b(a,  t)  to  be  zero  until  the  age  of  onset  of  maturity  and  zero  after  menopause. 
In  between  these  ages  the  fecundity  rate  varies  according  to  age;  for  example, 
in  humans,  women  of  age  25  are  more  fertile  than  women  of  age  40.  Figure  5.1 
depicts  the  evolution  of  age  structure  profiles  in  a  three-dimensional  atu- space 
(age-time-density  space). 

The  governing  dynamics  is  given  by  the  equation 

Ut  =  —ua  —  m(a)u. 

Thus,  the  model  equation  is  the  advection  equation  with  speed  one  and  sink 
term  given  by  the  mortality  rate;  notice  that  the  flux  is  =  u,  or  the  number 
crossing  the  age  a  line  at  time  t  is  just  u(a,t),  the  density  at  that  age.  This 
continuous  time  model  has  its  origins  in  the  work  of  McKendrick  in  1926  and 
in  subsequent  studies  by  von  Forester  in  1959.  The  equation  is  often  called 
the  McKendrick— von  Forester  equation.  Now  we  formulate  the  boundary 
condition  u(0,£),  the  number  of  newborns  at  time  t.  Because  u(a,t)da  is  the 
number  of  females  between  a  and  a  +  da,  and  b(a,  t)  is  the  average  reproduction 
rate,  the  number  of  offspring  produced  by  females  between  age  a  and  age  a  +  da 
is  b(a,  t)u{a ,  t)da.  Thus,  the  total  number  of  offspring  produced  by  all  females 
is  the  (sum)  integral  over  all  ages,  or 
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Figure  5.1  Age-structured  model:  /(a)  is  the  initial,  known  age  structure, 
and  u(0,t)  =  B(t)  is  the  unknown  offspring  at  age  a  =  0  and  time  t.  The 
age  structure  u(a,t)  for  a  >  t  is  affected  only  by  the  initial  population  /(a), 
whereas  for  a  <  t  it  is  affected  by  the  entire  population  and  its  fecundity;  ac,  is 
the  maximum  lifetime  age.  Individuals  follow  paths,  or  characteristics,  a  =  t- j- 
constant  in  age-time  space 


Therefore  we  have  derived  the  age-structured  model 

Ut  =  —  ua  —  m(a)u ,  a  >  0,  t  >  0,  (5.1) 

nOO 

u(0,t)=  /  b(a,t)u(a,t)da,  t  >  0,  (5-2) 

Jo 

u(a,  0)  = /(a),  a  >  0.  (5.3) 

We  emphasize  again  that  the  left  boundary  condition  at  age  a  =  0  is  not  known, 
but  rather  depends  upon  the  solution  u(a,  £),  which  is  also  unknown.  This  type 
of  condition  is  called  a  nonlocal  boundary  condition  because  it  depends 
upon  the  integrated  unknown  solution  in  the  problem. 

Age-structured  models  may  be  visualized  as  a  conveyor  belt  of  age  length 
0  <  a  <  cll  moving  at  speed  one  year,  per  year.  Individuals,  representing  a 
cohort,  enter  at  age  a  =  0  and  move  forward  along  the  belt  in  time.  As  they 
progress  they  are  subject  to  mortality,  and  they  reproduce  between  the  ages  of 
fertility,  am  <  a  <  clm>  Those  newborns  then  enter  the  belt  at  age  a  =  0. 

Stable  Age  Structure 

Rather  than  attempting  to  solve  (5. 1-5.3)  directly,  we  take  a  different  tack 
common  in  demographic  models.  We  ignore  the  initial  condition  (5.3)  and  ask 
what  happens  over  a  long  time.  Births  from  the  initial  population  /(a)  only 
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affect  the  solution  for  a  finite  time  because  those  individuals  and  their  offspring 
die.  Therefore,  in  the  case  that  the  maternity  function  is  independent  of  time, 
i.e. ,  b  =  6(a),  we  look  for  a  stable  age  structure  of  the  form 


u(a,t)  =  /7(a)e  ,  t  large 

where  /7(a)  is  an  unknown  age  structure  and  r  is  an  unknown  growth  rate. 
Substituting  into  the  PDE  (5.1)  and  making  reductions  gives  an  ODE  for  U (a), 

Ufa)  =  —(m(a)  +  r)U  (a). 

This  equation  can  be  solved  by  separation  of  variables  to  get 

U(a)  =Ce~rae-tim(B)dB, 

where  C  is  a  constant.  Letting  S(a)  =  exp  (  —  j("  m(s)ds)  denote  the  survivor- 
ship  function ,  which  is  the  probability  of  surviving  to  age  a,  we  can  write  the 
long  time  solution  as 

u(a,t)  =  Cert~raS(a).  (5.4) 

To  determine  the  growth  constant  r,  we  substitute  (5.4)  into  the  nonlocal 
boundary  condition  (5.2)  to  obtain 


•oo 


1  = 


b{a)e  raS(a)da , 


(5.5) 


o 


which  is  the  Euler— Lotka  equation.  Using  numerical  methods  this  equation 
can  be  solved  for  r,  and  we  will  have  determined,  up  to  a  constant,  the  long-time 
age  structure  U (a)  of  the  population  and  its  growth  rate  r. 

In  the  special  case  m  =  constant,  the  Euler-Lotka  equation  is 


1 


•oo 

b(a)e~l'r+m',ada. 


This  equation  determines  the  growth  rate  r  of  the  population.  The  exercises 
request  calculations  for  specific  population  data. 


The  Renewal  Equation 

The  method  of  characteristics  introduced  in  Section  1.2  may  be  used  to 
study  (5. 1-5.3)  in  the  simple  case  when  b  =  b(a)  and  m  =  constant.  The  PDE 
(5.1)  is 

Ut  =  —ua  —  mu ,  a  >  0,  t  >  0.  (5.6) 

If  we  change  independent  variables  via  the  formulae 


f  =  a-t,  r  =  t , 
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then  the  PDE  (5.6)  becomes 

UT  =  —mil,  where  U  =  U  (£,  t). 

This  equation  has  general  solution 

U(^T)  =  C(Oe~mT, 

where  C  is  an  arbitrary  function.  In  terms  of  the  original  variables, 

u(a,t)  =  C(a  — 

To  determine  the  arbitrary  function  C  we  consider  two  cases,  a  >  t  and  a  <  t. 
See  Figure  5.1.  The  arbitrary  function  is  different  in  each  case.  The  solution  in 
a  >  t  is  determined  by  the  initial  age  structure  and  we  have 

u(a,  0)  =  C(a)  =  /(a). 


Therefore 

u(a,t)  =  /(a  —  t)e~mt ,  a  >  t.  (5.7) 

For  a  <  t  we  denote,  for  simplicity,  B(t)  =  tz(0,  t).  Then,  applying  the  boundary 
condition  gives 

u(  0,  t)  =  B(t)  =  C(—t)e~rnt , 


or 

C{s)=B(-s)e-ms. 

Consequently 

u(a,t)  =  B(t  —  a)e-ma ,  a  <  t.  (5.8) 

The  solution  to  (5. 1-5.3)  in  the  case  m(a)  =  m  and  b{a,t)  =  b(a)  is  given 
by  (5. 7-5. 8),  but  B  is  still  unknown.  To  find  B  we  substitute  the  expressions 
(5. 7-5. 8)  into  the  yet  unused  nonlocal  boundary  condition  (5.2).  After  breaking 
up  the  integral  into  two  age  domains,  we  obtain 


*oo 


B(t) 


b(a)u(a ,  t)da 


o 


nt  POO 

=  /  b(a)u(a,  t)da  +  /  b(a)u(a,t)da, 

J  o  it 


or 


PL  POO 

B(t)  =  /  b(a)B(t  —  a)e~rnada  +  /  b(a)f(a  —  t)e~rntda.  (5.9) 

Jo  it 

Equation  (5.9)  is  a  linear  integral  equation  for  the  unknown  B(t),  and  it  is 
called  the  renewal  equation.  Once  it  is  solved  for  B(t ),  then  (5. 7-5. 8)  give 
the  age  structure  for  the  population.  In  Exercise  2  the  reader  is  asked  to  solve 
the  renewal  equation  in  a  special  case.  Generally,  (5.9),  a  nonhomogeneous 
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Volterra  equation,  is  difficult  to  solve  and  must  be  dealt  with  numerically,  or 
by  successive  approximation  (iteration). 

Structured  Predator-Prey  Model 

The  previous  discussion  illustrates  how  age  structure  leads  to  an  advection 
equation  that  can  be  solved,  in  theory,  by  standard  change  of  variable  methods. 
Now  we  consider  a  more  difficult  nonlinear  predator-prey  problem  where  a  dif¬ 
ferent  technique  is  illustrated.  The  method  is  called  the  method  of  moments, 
which  is  akin  to  an  energy  method.  We  can  add  this  important  technique  to 
our  analytic  tool  bag  for  dealing  with  PDEs. 

We  consider  a  population  of  prey  with  age  density  u(a,  t)  and  constant  per 
capita  mortality  rate  m.  Then,  as  above,  the  governing  age-time  dynamics  is 
given  by 

Ut  =  —ua  —  mu ,  a  >  0,  t  >  0,  (5.10) 

where  the  initial  number  of  prey  is 

u(a ,  0)  =  f(a).  (5-11) 


The  total  prey  population  is 


•  oo 


N(t)  =  I  u(a ,  t)da. 
o 


We  assume  that  the  maternity  function  is 


b{a )  =  b^ae 


—7  a 


Then  the  prey  produce  offspring  (eggs)  given  by 

rOO 

B(t)  =  /  b^ae~iau{ap)da. 

Jo 

Now  let  us  introduce  a  total  predator  population  P  =  P(t)\  we  shall  not  con¬ 
sider  age-structure  in  the  predator  population,  but  only  that  the  predators 
eat  the  eggs  of  the  prey.  (To  stimulate  thinking  about  this  model,  remember 
one  of  the  theories  posed  for  the  extinction  of  the  dinosaurs  is  that  egg-eating 
predators  caused  the  demise).  Because  predators  eat  only  eggs  (a  =  0),  the 
PDE  is  unaffected.  What  is  affected  is  the  actual  number  of  offspring  u( 0,  t) 
produced.  Thus  we  no  longer  have  u( 0,t)  =  B(t),  but  rather  we  must  include 
a  predation  term  that  decreases  the  egg  population.  The  simplest  model  is  the 
Lotka-Volterra  model,  which  requires  that  the  number  eggs  eaten  is  propor¬ 
tional  to  the  product  of  the  number  of  eggs  and  the  number  of  predators.  That 
is,  we  have 


u(0,t)  =  B(t )  —  kB(t)P(t ), 
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where  k  is  a  proportionality  constant.  But  the  right  side  can  be  negative,  so  we 
define  M(P,  P )  =  ma x(P  —  kBP ,  0)  and  take  the  number  of  eggs  at  a  =  0  to 
be 

u(0,t)  =  M(P,P).  (5.12) 

This  equation  provides  the  boundary  condition  to  the  problem.  Finally,  we  ask 
how  the  predator  population  changes.  We  assume  Lotka-Volterra  dynamics 

dP 

—  =  - 5P  +  cBP ,  (5.13) 

where  S  is  the  per  capita  mortality  rate  and  c  is  the  yield.  Hence,  in  the  absence 
of  eggs,  predators  die  out.  Initially,  we  take  P( 0)  =  P$. 

We  remark  that  if  the  predators  consumed  prey  other  than  eggs,  then  a 
predation  term  would  have  to  be  included  as  a  sink  term  on  the  right  side  of 
the  dynamical  equation  (5.10). 

The  method  of  moments  allows  us  to  obtain  a  system  of  ordinary  differential 
equations  for  the  total  prey  and  predator  populations  N(t)  and  P(t).  In  the 
analysis,  we  will  also  obtain  equations  for  some  additional  auxiliary  variables, 
but  the  overall  end  result  is  a  system  of  ODEs,  which  is  simpler  than  the  mixed 
PDE-ODE  system  given  by  the  model  equations  (5.10-5.13). 

The  idea  is  to  multiply  the  PDE  (5.10)  by  a  moment  function  g{a)  and 
then  integrate  over  0  <  a  <  oc.  The  only  requirement  is  that  u(a,t)g(a)  — >  0 
as  a  oo.  Upon  taking  g  to  be  different  functions,  we  can  obtain  equations 
that  lead  to  the  differential  equations  we  seek.  The  reader  will  find  it  valuable 
to  verify  these  calculations.  Proceeding  in  a  general  way,  we  multiply  the  PDE 
by  g  and  integrate  to  get 


d 

—  /  g(a)u(a,t)da  = 
dt  J  o 


•OO  POO 

g{a)ua(a,t)da  —  m  /  g(a)u(a,t)da. 

o  Jo 


The  first  integral  on  the  right  can  be  integrated  by  parts  to  get 

p  OO  P  OO  p  oo 

—  /  g(a)u(a,t)da  =  M(B,  P)g( 0)+ /  ^7(a)^(a,  t)da— m  /  g(a)u(a,t)da. 

dt  Jo  Jo  Jo 

(5.14) 

Now  we  make  different  choices  for  g.  If  g(a)  =  1  then  (5.14)  becomes  simply 


dN 

dt 


M[B ,  P)  -  mN , 


(5.15) 


an  ODE  involving  7V,P,  and  B.  If  we  take  g(a)  =  5(a),  the  maternity  function, 
then  (5.14)  becomes 


dB 


dt 


7 B  +  5qP  —  raP, 


(5.16) 
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where  H  =  H(t)  is  defined  by 

rOC 

H(t)  =  /  e~iau{a^t)da. 

J  o 

But  now  H  is  yet  a  new  variable.  To  get  another  equation  involving  H  we  take 
g(a)  =  e~ia .  Then  (5.14)  becomes 

d  M 

—  =  M{B,P)  -  (m  +  ~/)H.  (5.17) 

u/L 

Therefore  we  have  four  ODEs  (5.13),  (5.15-5.17)  for  IV,  P,  P,  and  H .  Clearly 
the  N  equation  decouples  from  the  system  and  we  can  just  consider  the  three 
ODEs 

<B-  =  -8P  +  cBP,  ^  =  -(m  +  j)B  +  b0H,  ^-=M(B,P)-(m  +  j)H. 
dt  dt  dt 

The  initial  conditions  are  P(0)  =  Pee  P(0)  =  /0°°  boae~1'CLf(a)da ,  P(0)  = 
J0°°  e~ia  f(a)da. 

We  can  now  proceed  with  a  numerical  method  (e.g.,  the  Runge-Kutta 
method)  to  solve  the  system  and  determine  the  dynamics.  The  reader  is  asked 
for  such  a  calculation  in  Exercise  4. 


EXERCISES 

1.  Consider  a  population  of  organisms  whose  per  capita  death  rate  is  three 
percent  per  month  and  that  the  maternity  function,  in  births  per  female 
per  age  in  months,  is  given  by  6(a)  =  4  for  3  <  a  <  8,  and  b(a)  =  0 
otherwise.  Use  the  Euler-Lotka  equation  to  calculate  the  growth  rate  r. 

2.  Consider  the  age-structured  model  (5. 1-5.3)  in  the  case  that  the  mortal¬ 
ity  rate  is  constant  (m(a)  =  7)  and  the  maternity  function  is  a  constant 
(6(a,t)  =  / 3 ).  At  time  t  =  0  assume  the  age  distribution  is  /(a)  =  uq  for 
0  <  a  <  S,  and  f(a)  =  0  for  a  >  S. 

(a)  Show  that  the  renewal  equation  (5.9)  takes  the  form 

B(t)  =  f  /3B(s)e~l(yt~s^ ds  +  /3Suoe~11:. 

J  0 

(b)  Show  that  B(t )  satisfies  the  differential  equation 

B'  =  (J3-  7 )B. 

Hint:  use  Leibniz’  rule  to  differentiate  the  integral. 

(c)  Determine  B(t)  and  the  population  density  u(a,t). 

(d)  What  is  the  total  size  N(t)  of  the  population  at  any  time  t? 
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3.  Consider  the  structured  model  where  the  per  capita  mortality  rate  depends 
upon  the  total  population  N  =  N(t )  and  the  maternity  function  is  b(a)  = 
boe~1CL  : 


'U't  —  Ua 


u{  0,  t) 


•  oo 


—  m(N)u ,  a  >  0,  t  >  0, 
b^e~iau{a^  t)da , 


u(a,  0) 


J  o 

/(a)» 


a  >  0. 


(a)  Use  the  method  of  moments  to  obtain  the  system  of  ODEs 

^  =  B  -  m(N)N,  (b0-  7  -  m(7V))  B, 

for  N(t)  and  the  offspring  B(t)  =  u( 0,  £).  (Note  that  the  maternity  function 
in  this  model  is  unreasonable  because  it  predicts  that  newborns  give  birth; 
but  it  may  be  a  good  approximation  when  the  females  reproduce  at  a  very 
young  age). 

(b)  Show  that  the  relation  B  =  —  7^  N  gives  a  solution  to  the  ODEs 

in  the  TViUplane. 

(c)  Show  that  the  solution  to  the  system  cannot  oscillate  and,  in  fact, 
approaches  a  steady  state. 

4.  Numerically  solve  equations  (5.13),  (5.15-5.17)  and  plot  the  prey  and 
predator  populations  N(t)  and  P(t)  for  0  <  t  <  125.  Take  &o  =  5  and 
the  remaining  constants  to  be  one.  Based  upon  your  calculation,  is  there 
a  basis  for  controlling  pests  by  introducing  predators  that  selectively  eat 
their  eggs? 

5.  Consider  an  age-structured  population  where  u  =  u(a,  t)  is  the  age  density 
of  people  at  age  a  at  time  t.  In  a  model  where  no  person  survives  past  age 
a  =  d,  the  PDE  for  u  is  given  by 

c 

Ut  +  ua  =  — - u ,  0  <  a  <  a,  t  >  0. 

a  —  a 

Interpret  this  model  and  use  the  methods  in  Chapter  1,  or  otherwise,  to 
find  the  general  solution.  Then  use  analytical  techniques  to  find  the  solution 
that  satisfies  the  initial  and  boundary  conditions 

u(a,  0)  =  /(a),  0  <  a  <  d;  u{ 0,£)  =  B(t ),  t  >  0, 

where  /(a)  is  the  initial  age  structure  and  B(t)  is  a  given  birth  schedule. 
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6.  This  exercise  presents  an  alternate  derivation  of  the  McKendrick-von 
Forester  equation  (5.1).  If  u(a ,  t)  is  the  density  of  females  of  age  a  at  time  £, 
then  at  a  small  time  dt  later,  all  females  still  alive  will  have  aged  an  amount 
da  =  dt.  Thus,  the  average  number  of  females  dying  in  this  interval  is 


u(a  +  da ,  t  +  dt)  —  u(a ,  t) 
dt 


— m(a)u(a ,  t)  +  0(dt ), 


where  m{a)u(a ,  t)dt  represents  the  probability  of  dying  in  the  small  time 
interval  t  to  t  +  dt.  Finish  the  derivation. 


5.2  Traveling  Waves  Fronts 

We  observed  in  Chapter  1  that  the  advection  equation  ut  =  —cux  and  the 
wave  equation  utt  =  c2uxx  both  admit  traveling  wave  solutions  of  the  form 
u(x,  t)  =  U (z),  where  U  is  any  function  (the  shape  of  the  wave),  and  z  =  x—ct  is 
a  moving  coordinate  that  travels  with  speed  c.  As  it  turns  out,  many  nonlinear 
PDEs  have  these  types  of  solutions.  They  model  a  signal,  or  disturbance,  that 
moves  with  constant  speed,  and  they  can  represent  moving  population  fronts, 
infectious  disease  or  epidemic  waves,  biological  invasions,  chemical  reaction 
fronts,  and  many  other  interesting  biological  phenomena. 

First  consider  the  model  diffusion- type  equation  having  the  general  form 

Ut  —  Duxx  T  f^u^uxj^  (5.18) 

where  /  is  some  fixed  function,  possibly  nonlinear,  depending  the  concentration 
u  and  its  spatial  gradient  ux.  We  are  interested  in  finding  solutions  of  the  form 
u(x,t)  =  U(z),  where  z  =  x  —  ct,  where  both  the  wave  form  U  and  the  wave 
speed  c  are  unknown.  The  spatial  independent  variable  x  is  assumed  to  vary 
over  all  real  numbers,  that  is,  —  oo  <  x  <  oo,  and  therefore  —  oo  <  z  <  oo.  We 
can  find  an  equation  for  the  shape  of  the  wave  U (z)  by  substituting  this  form 
into  the  PDE.  First,  we  calculate  the  partial  derivatives  of  u  in  order  to  make 
the  substitution.  Using  the  multi-variable  chain  rule,  we  get 

ut  =  U\z)zt  =  -cU\z), 
u x  —  U  (z)zx  —  U  (^z) , 

UXX  =  U"{z)zx  =  U’\z). 

Substituting  into  the  PDE  produces  an  ODE  for  the  wave  form  U(z): 

-cU'  =  DU"  +  f(u,U'), 


—  OO  <  Z  <  OO. 


(5.19) 
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Notice  that  the  ODE  has  only  two  variables,  the  independent  “moving”  coor¬ 
dinate  z  and  the  dependent  wave  form  U.  The  variables  t  and  x  dropped  out  of 
the  problem,  which  is  required  if  the  method  is  to  succeed.  The  constant  wave 
speed  is  not  known. 

Ordinary  differential  equations  are  sometimes  easier  to  deal  with  than 
PDEs.  In  the  present  case  we  have  a  second-order  ODE  where  the  independent 
variable  is  any  real  number.  Solutions  of  such  equations  usually  depend  upon 
two  arbitrary  constants,  and  we  need  two  auxiliary  conditions  to  determine 
those  constants.  Therefore  we  impose  conditions  at  z  =  Too  on  the  wave  form, 
namely  that  U ( z )  approach  constant  values,  or  densities,  at  both  extremes: 

£/(— oc)=iq,  U(+oc)  =  ur ,  (5.20) 

where  ui  and  ur  are  two  fixed  constants.  We  assume  these  two  states  are  equi¬ 
librium  states  for  the  equation,  i.e.,  0^  =  f(ur,  =  0.  Thus  we  are  not 

interested  in  any  traveling  wave,  but  only  those  that  are  bounded  and  approach 
constant,  equilibrium  states  at  z  =  Too.  These  special  types  of  traveling  wave 
solutions  are  called  wave  front  solutions.  See  Figure  5.2. 


u 


r 


x 


Figure  5.2  Plot  of  two  successive  wave  profiles  of  a  right,  traveling  wave  front 
solution  moving  at  speed  c 


To  complicate  matters,  the  wave  speed  c  is  also  unknown  and  often  acts  as 
an  eigenvalue ;  that  is,  wave  front  solutions  will  exist  only  for  certain  values  of 
c.  Hence,  we  often  refer  to  (5.19-5.20)  as  a  nonlinear  eigenvalue  problem. 

Wave  fronts  can  arise  from  natural  settings  as  limiting,  or  long  time,  solu¬ 
tions  to  boundary  value  problems.  As  an  illustration,  consider  the  initial  BVP 
for  a  toxic  chemical  of  concentration  u  =  u(x,  t )  in  a  semi-infinite  domain,  e.g., 
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a  long  canal: 

Ut  =  Duxx  +  f(u,uxj,  x  >  0,  t  >  0, 

u(x,  0)  =  0,  x  >  0, 
u( 0,  t)  =  1,  t  >  0, 
u(  oo,  t)  =  0. 

Here  /  is  a  term  that  contains  advection  and  reaction  processes.  Intuitively,  we 
reason  as  follows.  For  early  times  a  concentration  wave  with  unit  magnitude 
at  the  left  boundary  (x  =  0)  begins  to  move  into  the  medium;  at  these  times 
the  shape  of  the  wave  changes  as  it  evolves.  However,  over  a  longer  time  the 
concentration  wave  can  begin  to  approach  a  wave  front  with  the  same  shape 
wave  form.  That  is,  the  solution  to  the  initial  BVP  may  approach  a  wave  front 
having  the  form  u  =  F{pc  —  ct)  for  some  wave  speed  c.  Often  we  just  look  for 
the  wave  front.  The  issue  of  whether  solutions  to  initial  BVPs  evolve  into  wave 
fronts  is  a  question  of  whether  the  wave  front  attracts  other  solutions,  i.e.,  it 
is  one  of  stability  of  the  front. 

Generally  we  cannot  find  an  explicit  solution  to  (5.19)  and  we  resort  to 
numerical  computation.  Oftentimes  we  can  prove  there  must  be  a  unique  solu¬ 
tion  without  solving  the  equation  in  any  manner.  A  standard  technique  is  to 
transform  (5.19)  to  a  system  of  first  order  ODEs  for  the  variables  U  and  V  —  U' 
in  the  UV  phase  plane.  We  obtain 

U'  =  V, 

V'  =  ~v-l-fmv). 

D  D  y  J 

Assuming  that  V,  or  equivalently  U',  goes  to  zero  at  z  =  Too,  the  problem 
of  finding  a  wave  front  solution  to  (5.19)  now  consists  of  finding  a  solution 

(trajectory)  in  phase  space  that  connects  the  two  equilibrium  points  ^iq,0^ 

and  ^r,0^.  A  thorough  discussion  of  the  phase  space  method  is  beyond  our 

scope  and  we  refer  the  reader  to  other  sources  (e.g.,  Logan  2001,  2009,  or  the 
references  given  at  the  beginning  of  this  chapter). 

Example  5.1 

Consider  the  diffusion  equation 


u+  =  Duxx. 

Solutions  of  the  form  u{x,  t)  =  U (z),  where  z  =  x  —  ct,  must  satisfy  the  ODE 

-cU'  =  DU". 
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This  is  a  linear  second-order  equation  with  characteristic  equation  —cm  =  Dm 2 
having  roots  m  =  0,  m  =  —c/D.  Therefore  the  independent  solutions  are  1  and 
e-cz/D ^  anq  the  general  solution  is  therefore 

U  (z)  =  ci  +  c2e~czlD , 


or 

u(x,t)  =  c1+c2e-c(x-ct)/D. 

Therefore  the  diffusion  equation  admits  traveling  wave  solutions,  but  not  (non¬ 
constant)  wave  front  solutions  because  e~cz^D  does  not  remain  bounded  at 
z  =  Too.  □ 

Example  5.2 

(Epidemic  waves)  We  consider  a  model  for  the  spread  of  a  rabies  epidemic 
throughout  a  healthy  population  of  foxes.  We  set  up  the  problem  in  one¬ 
dimensional  linear  geometry,  and  we  ignore  natural  mortality  and  births.  Let 
S  =  S(x,  t )  denote  the  density  of  the  susceptible  fox  population  and  I  =  I (x,  t) 
the  density  of  the  infected  foxes.  We  assume  the  rate  that  healthy  foxes  become 
infected  is  proportional  to  5/,  the  argument  being  that  the  number  of  contacts 
between  members  of  the  two  groups  is  equal  to  their  product;  thus  the  rate  of 
conversion  of  susceptibles  to  infecteds  is  bSI,  where  b  is  the  constant  infection 
rate,  or  the  fraction  of  the  contacts  that  lead  to  infection.  Because  rabies  is  fatal, 
we  assume  that  infected  foxes  die  at  per  capita  rate  r  and  they  are  removed 
from  the  susceptible  and  infected  populations.  Finally,  we  assume  that  healthy 
foxes  are  territorial  and  do  not  diffuse,  while  rabid  foxes  are  disoriented  and 
diffuse  randomly  with  diffusion  constant  D.  Thus  we  have  the  model  equations 

St  =  ~ bSI , 

It  =  bSI  -rl  +  DIXX. 

By  an  epidemic  wave  we  mean  a  wave  of  infecteds  moving  into  a  healthy  pop¬ 
ulation,  and  we  model  it  by  a  wave  front  of  unknown  wave  speed  c.  Letting 
S  =  S(z),  I  =  /(z),  z  =  x  —  cb,  we  get  wave  front  equations 

—cS'  =  - bSI , 

-cl'  =  bSI  -rl  +  DI". 

At  z  =  Too,  ahead  of  the  infection  or  epidemic  wave,  we  assume  there  are  only 
susceptibles,  so  S(+ oo)  =  TV,  where  N  is  the  total  population  of  foxes,  and 
/(+oo)  =  0.  After  the  wave  has  passed,  at  z  =  —  oo,  we  assume  no  infecteds 
remain,  or  /(— oo)  =  0  (the  epidemic  has  died  out),  and  the  number  of  suscep¬ 
tibles  is  S(— oo)  =  A/,  which  is  not  known  a  priori.  We  want  to  determine  the 
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speed  c  of  the  epidemic  wave  and  Si,  the  number  that  will  not  get  the  disease. 
Figure  5.3  shows  the  shapes  of  the  waves  that  we  might  expect. 


Figure  5.3  SI  epidemic  wave  S  =  S(x  —  ct) 


To  make  progress,  note  that  S' / S  =  (In S')7,  so  the  first  equation  becomes 
I  =  (c/b)  (In  S)' .  We  can  substitute  this  into  the  second  equation  to  obtain 


rc 


cl'  =  cS'  -  —  (In  S)' +  £>/", 

b 


which  can  be  integrated  with  respect  to  z  since  every  term  is  a  derivative.  Then 


rc 


-cl  =  cS  -  —  In  S  +  DI'  +  a, 

b 


where  a  is  a  constant  of  integration.  Taking  the  limit  as  z  Too  gives  a  = 
c(^  In  TV  —  .  Taking  the  limit  as  z  —>  — oo  gives  another  equation  relating  a 

and  c,  namely  a  =  c  (  ^  In  Si  —  Sn .  Therefore 


y  \n N  -  N  =  j  \n  Si  —  Si,  (5.21) 

b  b 

which  is  a  relation  between  the  number  Si  that  do  not  get  the  disease,  the  total 
initial  population  TV,  the  infection  rate  6,  and  the  death  rate  r.  The  relation  is 
independent  of  the  epidemic  wave  speed  c.  To  simplify,  we  introduce  parameters 


r 


where  F  is  the  fraction  that  do  not  contract  rabies  and  Ro  is  the  reproductive 
ratio  of  the  disease,  representing  the  death  rate  divided  by  the  infection  rate. 
Then  (5.21)  can  be  written  in  the  form 


F  —  1 
In  F  ’ 


0  <  F  <  1. 


A  plot  of  this  function  is  shown  in  Figure  5.4,  from  which  we  infer  that  Rq 
cannot  exceed  1.  Thus,  no  epidemic  wave  front  can  occur  when  Rq  >  1.  This  is 
reasonable  since  the  death  rate  would  be  high  compared  to  the  rate  of  getting 
the  disease;  the  epidemic  dies  out  because  the  infecteds  die  out  faster  than  they 
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are  replaced.  This  case  can  occur  if  b  is  small  or  can  be  forced  small,  say,  by 
inoculation.  On  the  other  hand,  if  Ro  <  1  then  a  epidemic  wave  can  propagate; 
the  smaller  Rq?  the  greater  the  number  that  get  rabies.  □ 


Several  questions  remain  in  this  example.  We  did  not  solve  the  wave  front 
equations  to  find  the  wave  forms  or  even  show  that  such  wave  forms  exists; 
nor  did  we  determined  the  speed  of  the  epidemic.  The  goal  has  been  to  derive 
certain  properties  of  the  epidemic  wave  under  the  assumption  that  it  exists.  A 
more  thorough  analysis,  given  in  the  references,  shows  that  such  solutions  do 
exist. 


EXERCISES 

1.  Show  that  the  nonlinear  advection-diffusion  equation 

u+  =  Duxx  —  uux 

admits  a  wave  front  solution  U  =  U(z),  satisfying  the  boundary  conditions 
U(—oo)  =  1,  U(+oo)  =  0.  Find  the  wave  speed  and  sketch  the  the  solution 
on  a  zu- coordinate  system  if  U(0)  = 

2.  Consider  a  reaction-diffusion  equation  with  a  nonlinear  advection  term 
given  by 

u+  =  uxx  —  v?ux. 

For  which  wave  speeds  c  does  a  positive  wave  front  solution  exist  with 
u  0  as  x  oo?  What  are  the  possible  states  at  x  =  —  oo? 
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3.  Consider  the  system  of  reaction-advection-diffusion  equations 

tq  —  Duxx  ^ux  aF(u,  v)^ 
vt  =  —bF(u,  v), 

where  a,  6,  D ,  and  7  are  positive  constants.  Find  a  system  of  ODEs  for 
traveling  wave  forms  u  =  U (z),  v  =  V (z),  z  =  x  —  ct.  If  boundary  conditions 
are  given  by 

U(—oo)  =  ui,  Z7(+oc)  =  0,  V  (—00)  =  0,  V  (+00)  =  vr, 


what  conditions  must  the  reaction  term  F  satisfy  for  wave  front  solutions 
to  exist?  Sketch  possible  wave  front  profiles.  Assuming  wave  fronts  exist, 
show  that  the  speed  c  of  the  wave  is  less  than  the  advection  speed  7. 

4.  The  following  system  of  PDEs  arises  in  the  study  of  bioremediation  of 
aquifer  systems  where  immobile,  indigenous  microbes  attached  to  the  soil 
are  stimulated  to  consume  a  contaminant  and  produce  nontoxic  products 
(see  Logan  2001,  p.  107): 

RSt  =  -vSx-  F,  At  =  -vAx-  rF,  Mt  =  yF  -  b (m  -  M0) , 


where  M(x,t)  is  the  density  of  the  microbes,  S(x,t)  is  the  density  of  the 
contaminant  (e.g.,  a  hydrocarbon),  A(x,  t)  is  the  density  of  a  nutrient  stim¬ 


ulant  (e.g.,  oxygen),  and  F  =  qSAM/ 


Ks  +  S )  { Ka  +  A 


is  the  biodegra¬ 


dation  rate.  R  >  1  is  the  retardation  constant,  v  is  the  average  velocity  of 
the  subsurface  flow,  b  is  the  decay  rate  of  the  bacteria,  y  is  the  yield,  Mq  is 
a  reference  microbe  density,  and  r  is  the  mass  of  the  nutrient  used  per  mass 
contaminant  degraded;  <7,  iLs,  and  Ka  are  rate  constants.  Find  the  speed 
c  of  an  assumed  wave  front  that  satisfies  the  boundary  conditions  S  =  0, 
A  =  Aq  at  —00,  S  =  5r,  A  =  0  at  +00,  and  M  =  Mq  at  ±00.  Without 
solving  the  wave  front  differential  equations,  sketch  anticipated  profiles  of 
5,  A,  and  M  as  a  function  of  the  variable  z  =  x  —  ct. 


5.  The  nonlinear  PDE  model 


^(1  +  b)u  —  mw2j  =  uxx  —  ux 

arises  in  the  subsurface  transport  of  a  contaminant  that  is  adsorbed  to  the 
soil.  Assume  b  >  m.  Find  a  wave  front  solution  u  —  U(x  —  ct)  satisfying 
the  conditions  U(—oo)  =  1,  f7(+oo)  =  0,  U( 0)  =  0.5.  The  solution  is 

1 


u(x,  t ) 


_|_  gcm(x-ct)  ' 


5.3  Equilibria  and  Stability 


245 


5.3  Equilibria  and  Stability 


We  already  noted  the  broad  occurrence  of  diffusion  problems  in  biological  sys¬ 
tems.  Now  we  investigate  another  aspect  of  such  problems,  namely  the  persis¬ 
tence  of  equilibrium  states  in  systems  governed  by  reaction-diffusion  systems. 
At  issue  is  the  stability  of  those  states:  if  a  system  is  in  an  equilibrium  state 
and  it  is  slightly  perturbed,  or  displaced,  does  the  system  return  to  that  state, 
or  does  it  evolve  to  a  completely  different  state? 

Underpinned  by  the  seminal  work  of  Alan  Turing  in  1952  on  the  chemical 
basis  of  morphogenesis,  it  has  been  shown  in  extensive  research  that  diffusion 
induced  instabilities  can  give  rise  to  spatial  patterns  in  all  sorts  of  biological 
systems.  Reaction-diffusion  models  have  been  used  to  explain  the  evolution  of 
form  and  structure  in  developmental  biology  (morphogenesis),  tumor  growth, 
ecological  spatial  patterns,  aggregation,  patterns  on  animal  coats,  and  many 
other  processes  in  molecular  and  cellular  biology. 

In  this  section  we  introduce  the  basic  idea  of  stability  in  reaction-diffusion 
models  and  we  observe  that  such  systems  can  have  instabilities  that  lead  to 
density  variations  and  patterns.  We  refer  the  reader  to  Murray  (2003)  for  an 
in-depth  treatment  of  these  models  in  the  life  sciences. 


Stability 


To  illustrate  the  notion  of  stability  of  an  equilibrium,  or  steady-state,  to 
a  PDE,  we  begin  with  a  review  of  the  stability  issue  for  ODEs.  For  example, 
consider  the  logistics  population  model 


du(t) 

dt 


ru(t) 


(5.22) 


where  r  is  the  intrinsic  growth  rate  and  K  is  the  carrying  capacity,  both  pos¬ 
itive.  A  steady-state,  or  equilibrium  solution ,  is  a  constant  solution  u  =  ue. 
For  such  a  solution  the  left  side  of  (5.22)  must  be  zero,  and  so  ue  satisfies  the 
algebraic  equation 

rue (1  -  -^0  =  0. 

Therefore  there  are  two  equilibria,  ue  =  0  and  ue  =  K.  If  at  some  time  t  the 
system  is  in  an  equilibrium  state,  it  remains  in  that  state  for  all  time.  That, 
if  u(to)  =  ue  for  some  to,  then  u(t)  =  ue  for  all  t  >  to  because  u(t)  =  ue 
satisfies  both  the  initial  condition  and  the  differential  equation;  uniqueness 
implies  this.  Natural  “perturbations,”  however,  often  disturb  equilibrium  and 
move  the  system  a  small  amount  away  from  its  steady  state.  Does  the  system 
return  to  that  state,  or  does  it  deviate  significantly  from  that  state,  possible 
going  to  another  equilibrium  state?  This  is  the  question  of  stability.  In  the 
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present  example,  if  the  system  is  at  carrying  capacity  u  =  K  and  a  small 
number  of  organisms  are  removed,  does  the  system  grow  back  to  that  original 
state  or  does  it  do  something  else?  To  answer  this  question  we  try  to  determine 
how  a  small  deviation,  or  perturbation ,  will  evolve  by  finding  an  equation  for 
the  deviation  and  solving  it.  Therefore,  let  U (t)  be  a  small  deviation  from  the 
steady  state  u  =  K.  That  is,  assume  u(t)  =  K  +  U(t).  Then  u(t)  must  still 
solve  the  model  equation.  Substituting  into  (5.22)  gives 

dM±mi  =  r(K  +  m)  (i  _ *  +  m  \ . 

dt  \  K  J 


Simplifying  the  right  side  and  noticing  that  dK/dt  =  0  gives 

dU  (t) 


dt 


=  ~{KU(t)  +  U(t) 2), 


(5.23) 


which  is  called  the  perturbation  equation.  In  general  we  cannot  always 
solve  the  perturbation  equation,  so  we  make  another  argument.  In  (5.23)  the 
perturbation  U(t)  is  small;  therefore  the  U(t )2  term  is  very  small  compared  to 
the  U (t)  term  and  we  can  discard  the  it  to  obtain  a  linearized  perturbation 
equation 

dU(t) 


dt 


-rint). 


This  equation  can  be  solved  instantly  to  obtain  U(t)  =  Ce~rt ,  which  shows 
that  the  perturbations  decay,  as  long  as  they  remain  small. 

This  linearization  procedure  can  be  applied  to  any  nonlinear  autonomous 
equation 

rhi 

-  /(u),  U  =  u(t). 


dt 


Equilibrium  solutions  u  =  ue  are  solutions  of  the  algebraic  equation 


/(«e)  =  0. 


(5.24) 


If  u(t)  =  ue-\-U(t),  where  U  =  U(t)  is  a  small  deviation  from  equilibrium,  then 
it  satisfies  the  perturbation  equation 

dU 


dt 


=  f(ue  +  U ) 


Because  U  is  assumed  small,  we  may  expand  the  right  side  in  its  Taylor  series 
about  ue.  Thus 

/  i^e  +  U)  =  /  {u^j  +  f'  U  +  ~f"  {^Vej  U2  +  *  •  •. 

Discarding  the  higher  order  nonlinear  terms,  while  using  (5.24),  we  obtain  a 
linearized  perturbation  equation 

dU 


dt 


=  f'(ueSjU. 


5.3  Equilibria  and  Stability 


247 


This  equation  has  general  solution  U(t)  =  CeAt,  where  A  =  f'(ue )  is  termed 
the  eigenvalue ;  if  A  <  0  the  perturbation  decays  and  the  equilibrium  ue  is 
stable,  and  if  A  >  0  the  perturbation  grows  and  the  equilibrium  is  unstable. 
If  A  =  0  there  is  no  information  and  we  have  to  take  into  account  additional 
terms  in  the  Taylor  series.  In  summary,  the  stability  of  a  equilibrium  state 
can  be  determined  by  examining  the  sign  of  the  eigenvalue,  i.e.,  the  sign  of 
the  derivative  of  the  right  side  of  the  differential  equation,  evaluated  at  that 
equilibrium  state. 

The  linearization  procedure  is  based  upon  the  assumption  that  the  pertur¬ 
bations  are  sufficiently  small.  The  type  of  stability  that  results  is  called  local 
stability  since  the  deviations  are  not  far  from  the  equilibrium  state.  The  pro¬ 
cedure  gives  no  results  about  global  stability ,  i.e.,  how  the  system  would  evolve 
if  any  perturbation,  regardless  of  its  magnitude,  were  permitted. 


Example  5.3 

In  the  logistics  equation  (5.22)  the  right  side  is 


Therefore, 


2  r 

=  r~KU ' 


To  determine  local  stability  of  the  equilibrium  state  ue  =  K  we  check  the 
eigenvalue 

2r 

A  =  f{K)  —  r  — —K  —  —r. 


Because  A  <  0  the  equilibrium  ue  =  K  is  stable;  small  perturbations  decay 
away.  The  other  equilibrium  solution  is  ue  =  0.  In  this  case  A  =  /'( 0)  =  r  > 
0,  and  therefore  the  zero  state  is  unstable;  if  an  initial  small  deviation  were 
imposed,  it  would  grow  and  the  system  would  depart  from  equilibrium.  □ 


Stability  for  PDEs 

A  similar  theory  of  local  stability  of  equilibrium  solutions  can  be  developed 
for  PDEs.  In  the  interval  0  <  x  <  L  consider  the  reaction-diffusion  equation 

'Ut  —  DuXx  T  f  0? 

with  no-flux  boundary  conditions  ux(0,t)  =  ux(n,t)  =  0.  Let  u(x,t)  =  ue 
be  a  constant  equilibrium  solution  (so  that  /(ne)  =  0).  To  fix  the  idea  let 
a  =  f(ue)  >  0.  Note  that  the  equilibrium  solution  satisfies  the  PDE  and 
the  boundary  conditions.  Next,  let  U(x,t)  be  a  small  perturbation  from  the 
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equilibrium  solution,  or  u(x,t)  =  ue  +  U{pc,t).  The  perturbation  equation  is 
determined  by  substituting  this  expression  into  the  PDE  and  the  boundary 
conditions: 


Ut  —  DU xx  +  / (ue  +  C/) ,  Ux( 0,  £)  —  UX(L,  t)  —  0. 

To  linearize  this  equation  we  expand,  as  above,  the  right  side  in  a  Taylor  series 
and  discard  the  nonlinear  terms: 


f(ue  +  U)=  f(ue)  +  f'(ue)u  +  •  •  •  =  f'(ue)U  +  ■■■ 

Then  we  obtain  the  linearized  perturbation  equation 

Ut  =  DUXX  +  aU,  a  =  /'(we)  >  0  (5.25) 

subject  to  boundary  conditions 

£4(0,  t)  =  UX(L,  t)  =  0.  (5.26) 

This  problem,  which  is  on  a  bounded  interval,  can  be  solved  by  the  sep¬ 
aration  of  variables  method  (Chapter  4).  We  assume  U  =  g(x)h{t)  and  then 
substitute  into  the  PDE  (5.25)  and  boundary  conditions  (5.26)  to  obtain 

gh!  =  Dg"h  +  agh , 


or 


Then  h  =  Cext  and 


hf_ 

~h 


Dg" 

g 


T  a  —  A. 


9" +  (~D~)  9  =  °’  =  9 =  °' 


This  equation  cannot  have  nontrivial  exponential  solutions  that  satisfy  the 
boundary  conditions;  therefore  a  —  A  >  0.  In  the  case  A  =  a  we  obtain  a 
constant  solution,  and  in  the  case  a  —  A  >  0  we  obtain 


i  a  —  A  la  —  A 

g(x)  =  A  sm  \  j  — — — x  +  B  cos  \l  — rr — x. 


D 


D 


Applying  g'( 0)  =  0  forces  A  =  0;  then  g'(L)  =  0  implies 


or 


a  —  Ar 

sin  V  — L  =  0 


a 


A 


D 


L 


n7r,  n  =  1,2, ... 
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Therefore,  incorporating  the  case  A  =  a,  we  obtain  eigenvalues 

Dr?  7T2 


A  =  A  n  =  a 


L2 


n  =  0,1,  2, ... 


(5.27) 


The  modal  solutions  are  therefore 


Un(x,  t)  =  eXnt  cos  — —  x,  n  —  0,1,2,... 

Jj 


Tin 


(5.28) 


What  conclusions  can  we  draw  from  this  calculation?  Because  the  general 
solution  U(x,t)  to  the  boundary  value  problem  (5.25-5.26)  is  a  linear  combi¬ 
nation  of  the  modal  solutions  (5.28), 


oo 

U (x,  t)  =  coeat  +  cneXnt  cos 

71=1 

it  will  decay  if  all  of  the  modes  decay,  and  it  will  grow  if  one  of  the  modes 
grow.  The  constants  cn  are  determined  by  the  initial  perturbation.  The  spatial 
part  of  the  modal  solutions,  i.e.,  the  cosine,  remains  bounded.  The  amplitude 
factor  eAnt,  and  therefore  the  eigenvalues  An,  determine  growth  or  decay.  Let 
us  examine  the  modes.  In  the  case  n  =  0  the  eigenvalue  is  A n  =  a  and  the 
modal  solution  is  coeat,  which  grows  exponentially.  In  fact,  any  mode  satisfying 
a  >  L2W  is  unstable.  Therefore,  instabilities  are  likely  to  occur  for  modes  of 
low  frequency  (n  small),  or  in  systems  of  large  size  L  or  low  diffusion  properties 
D.  Oppositely,  small  systems  with  large  diffusion  constants  are  stabilizing,  as 
are  the  high  frequency  modes.  In  the  most  general  case  the  initial  perturbation 
contains  all  modes  and  the  steady  state  is  locally  unstable. 


Cell  Aggregation 

We  apply  these  ideas  to  a  problem  in  cell  aggregation.  A  slime  mold  popula¬ 
tion  is  a  collection  of  unicellular  amoeboid  cells  that  feed  on  bacteria  in  the  soil. 
When  the  food  supply  is  plentiful,  the  bacteria  are  generally  uniformly  spatially 
distributed  throughout  the  soil;  but  as  the  food  supply  becomes  depleted  and 
starvation  begins,  the  amoeba  start  to  secrete  a  chemical  (cyclic  AMP)  that 
acts  as  an  attractant  to  the  other  amoeba  and  aggregation  sites  form.  The  rest 
of  the  story  is  even  more  interesting  as  the  aggregation  sites  evolve  into  slugs 
that  ultimately  develop  into  a  sporangiophores  consisting  of  a  stalk  and  head 
containing  new  spores.  The  spores  are  released  and  the  process  begins  anew. 
We  are  interested  here  in  only  the  first  part  of  this  complicated  problem,  the 
onset  of  aggregation.  We  work  in  one  spatial  dimension. 

Let  a  =  a(x,  t)  and  c  =  c(x,  t)  denote  the  density  and  concentration  of  the 
cellular  amoeba  and  cyclic  AMP,  respectively.  The  fundamental  conservation 
laws  are 

at  =  ct  =  -<j>ic)  +  F, 
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where  q ^  and  q ^  are  the  fluxes  of  the  amoeba  and  the  chemical,  respectively. 
There  are  no  source  terms  in  the  amoeba  equation  because  we  do  not  consider 
birth  and  death  processes  on  the  time  scale  of  the  analysis.  The  source  term  F 
in  the  chemical  equation  consists  of  two  parts:  production  by  the  amoeba  and 
degradation  in  the  soil.  We  assume  the  chemical  is  produced  by  the  amoeba  at 
a  rate  proportional  to  the  density  of  the  amoeba,  and  the  chemical  degrades  at 
a  rate  proportional  to  its  concentration;  i.e., 

F  =  fa  —  kc. 

The  chemical  moves  by  diffusion  only,  and  we  assume  Field s  law: 

<y)  =  -Sex, 

where  S  is  the  diffusion  constant.  The  amoeba  are  also  assumed  to  randomly 
diffuse,  but  there  is  another  flux  source  for  the  amoeba,  namely  attraction  to  the 
chemical.  We  assume  this  attraction  is  up  the  chemical  gradient ,  toward  high 
concentrations  of  c.  Additionally,  it  should  depend  on  the  amoeba  population 
because  that  increases  the  magnitude  of  the  chemical  concentration  released. 
This  type  of  flow  toward  a  source,  induced  by  chemical  gradients,  is  called 
chemotaxis.  Therefore  we  assume 

=  random  flux  +  chemotatic  flux 

—  frax  T  vaCx-, 

where  p  is  the  amoeba  motility  and  v  is  the  strength  of  the  chemotaxis,  both 
assumed  to  be  positive  constants.  Note  that  the  random  flux,  having  the  form 
of  Fick’s  law,  has  a  negative  sign  since  flow  is  “down  the  gradient”  (from  high 
to  low  densities),  and  the  chemotatic  flux  has  a  positive  sign  since  that  term 
induces  flow  “up  the  chemical  gradient”  (from  low  to  high  concentrations). 
Putting  all  these  equations  all  together  gives  the  system 

at  —  h^xx  ^ 5  Q;  —  3cXx  T  f &  kc.  (5.29) 

Both  a  and  c  satisfy  no-flux  boundary  conditions,  i.e.,  ax  =  cx  =  0  at  x  =  0,  L, 
which  means  there  is  no  escape  from  the  medium. 

Notice  that  there  will  be  an  equilibrium  solution  a  =  a,  c  =  c  to  (5.29) 
provided 

fa  =  kc. 

That  is,  the  production  of  the  chemical  equals  its  degradation.  This  equilibrium 
state  represents  the  spatially  uniform  state  in  the  soil  before  aggregation  begins. 
To  determine  the  local  stability  of  this  state  we  let 

a  =  a  +  A(x,t),  c  =  c  +  C(x,  £), 


5.3  Equilibria  and  Stability 


251 


where  A  and  C  are  small  perturbations.  Substituting  these  quantities  into  (5.29) 
gives,  after  simplification,  the  perturbation  equations 

At  =  (iAxx  —  v((cl  +  A)C^j  ,  Ct  =  $CXX  +  f A  —  kC. 

These  equations  are  nonlinear  because  of  the  ACX  term  in  the  amoeba  equation. 
If  we  discard  the  nonlinear  term  on  the  assumption  that  the  product  of  small 
terms  is  even  smaller,  then  we  obtain  the  linearized  perturbation  equations 

At  =  fiAxx  —  vaCxx ,  Ct  =  SCXX  +  f  A  —  kC.  (5.30) 

Easily  one  can  see  that  the  perturbations  satisfy  the  no-flux  boundary  condi¬ 
tions. 

Motivated  by  our  knowledge  of  the  form  of  solutions  to  linear  equations,  we 
assume  there  are  modal  solutions  of  the  form 

A(x,  t)  =  c\eat  cosrx,  C(xA)  =  C2eatcosrx,  (5.31) 

where  r  and  cr  are  to  be  determined,  and  c\  and  C2  are  constants.  Notice  the 
form  of  these  assumed  solutions.  The  spatial  part  is  bounded  and  periodic 
with  frequency  r  and  period  27r/r,  and  the  temporal  part  is  exponential  with 
growth  factor  cr,  which  may  be  a  real  or  complex  number.  If  cr  is  negative  or 
has  a  negative  real  part,  then  the  perturbation  will  decay  and  the  equilibrium 
state  will  return  (stability);  if  cr  is  positive  or  has  positive  real  part,  then  the 
perturbations  will  grow  and  the  equilibrium  will  be  unstable. 

Let  us  ask  why  solutions  of  (5.30)  should  be  of  this  form  (5.31)  without 
going  through  the  entire  separation  of  variables  argument.  Equations  (5.30) 
are  linear  with  constant  coefficients,  and  both  equations  contain  A  and  C  and 
their  derivatives.  If  we  are  to  have  a  solution,  then  all  the  terms  must  match 
up  in  one  way  or  another  in  order  to  cancel.  So  both  A  and  C  must  essentially 
have  the  same  form.  Because  there  is  a  first  derivative  in  £,  the  time  factor 
must  be  exponential  so  it  will  cancel,  and  the  spatial  part  must  be  a  sine  or  a 
cosine  because  of  the  appearance  second  spatial  derivative.  We  anticipate  the 
cosine  function  because  of  the  no-flux  boundary  conditions,  as  in  Chapter  4.  If 
we  substitute  (5.31)  into  (5.30)  we  obtain  the  two  equations 

(a  +  /rr2^ci  —  var2C2  =  0,  —  fc\  +  (a  +  k  +  Sr2^jc2  =  0 

which  relate  all  the  parameters.  We  may  regard  these  two  equations  as  two 
linear,  homogeneous  for  the  constants  c\  and  C2-  If  we  want  a  nontrivial  solution 
for  c i  and  C2,  then  from  matrix  theory  the  determinant  of  the  coefficient  matrix 
must  be  zero.  That  is, 

(cr  +  /rr2^  (cr  k  +  Sr2^j  —  fvar 2  =  0, 
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which  is  an  equation  relating  the  temporal  growth  factor  cr,  the  spatial  fre¬ 
quency  r,  and  the  other  constants  in  the  problem.  Expanded,  this  equation  is 
a  quadratic  in  cr, 

cr2  +  71  a  +  72  =  0, 


where 


7i  =  r2(/i  +  7)  +  /c  >  0, 


72 


/i 


8r‘ 


+  k) 


The  roots  of  the  quadratic  are 


cr  = 


1 

2 


Clearly  one  of  the  roots  is  always  negative  or  has  negative  real  part.  The 
other  root  can  have  positive  or  negative  real  part,  depending  upon  the  value 
of  the  discriminant  y2  —  472.  We  are  interested  in  determining  if  there  are 
parameter  choices  that  lead  to  an  instability;  so  we  want  cr  positive.  Hence,  72 
must  be  negative,  or 

f-i^Sr2  +  k^j  <  fva. 

If  this  inequality  holds,  there  is  an  unstable  mode  and  perturbations  will  grow. 
We  analyze  this  further. 

The  number  r  is  the  spatial  frequency  of  the  perturbations.  Applying  the 
no-flux  boundary  conditions  forces 

r  =  n  =  0,1,  2,  3, ... 

±J 

For  each  value  of  n  we  obtain  a  frequency  r  =  rn  and  a  corresponding  growth 
factor  crn.  The  nth  mode  will  therefore  grow  and  lead  to  local  instability  when 


(n2n2 

v~ 


<  fva. 


(5.32) 


We  can  now  ask  what  factors  destabilize  the  uniform,  equilibrium  state  in 
the  amoeba-cAMP  system  and  therefore  promote  aggregation.  That  is,  when 
is  (5.32)  likely  to  hold?  We  can  list  the  factors  that  may  make  the  left  side  of 
the  inequality  (5.32)  smaller  than  the  right  side:  low  motility  /x  of  the  bacteria; 
low  degradation  rate  k  or  large  production  rate  /  of  cAMP;  large  chemotactic 
strength  v\  large  dimensions  L  of  the  medium;  a  small  value  of  n  (thus,  low 
frequency,  or  long  wave  length,  perturbations  are  less  stabilizing  that  short 
wave,  or  high  frequency,  perturbations);  decreasing  the  diffusion  constant  of 
the  cAMP.  Figure  5.5  shows  time  snapshots  of  the  amoeba  density  for  the 
mode  n  =  2  when  it  is  unstable.  The  intervals  where  the  amplitude  is  high 
correspond  to  higher  concentrations  of  amoeba,  i.e.,  regions  of  aggregation. 
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Figure  5.5  Plot  showing  the  growing  amoeba  density  at  time  t  when  the 
uniform  state  is  unstable  to  local  perturbations  in  the  mode  n  =  2 


EXERCISES 

1.  Consider  an  fish  population  that  grows  logistically,  and  at  the  same  time 
is  harvested  at  a  rate  proportional  to  the  population.  The  model  is 

du  (  u  \ 

— —  =  ru  1  —  —  —  ha 

dt  V  KJ 

where  r,  iL,  and  h  are  the  growth  rate,  carrying  capacity,  and  harvesting 
rate,  respectively.  Find  all  the  equilibria  and  analyze  their  stability.  What 
are  the  possible  long  term  fish  populations? 

2.  (Turing  system)  Consider  the  system  of  reaction  diffusion  equations  on 
the  spatial  domain  0  <  x  <  L  given  by 

ut  =  auxx  +  f(u,v),  vt  =  j3vxx  +  g(u,v) 

with  no-flux  boundary  conditions  ux  =  vx  =  0  at  x  =  0,  L.  Let  u  =  u, 
v  =  v  be  an  equilibrium  solution  and  define  small  perturbations  U  and  V 
from  equilibrium  given  by 

u  =  u  +  U(x,t),  v  =  vJrV(x,t). 

(a)  Show  that  U  and  V  satisfy  no-flux  boundary  conditions  and  the  lin¬ 
earized  perturbation  equations 


Ut  =  aUxx  +  fu(u,  v)U  +  fv(u,  v)V, 
Vt  =  /3VXX  +  gu(u,v)U  +  gv(u,v)V. 


(5.33) 

(5.34) 
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(b)  Introduce  matrix  notation 


f  U  \  D  =  (  a  0  i  J  =  (  fv(u,v)  \ 

V^/  V0/5/  V  9u(u,v)  gv(u,v)  )  ’ 


and  show  that  (5.33-5.34)  can  be  written  as 

lb  =  DV^XX  +  M. 


(5.35) 


(c)  Assume  modal  solutions  to  (5.35)  of  the  form 


cos 


TITTX 


and  show  that  for  a  nontrivial  solution  we  must  have 


det  (  anI  + 


n2  7T2 

~U~ 


D-J  =0. 


(5.36) 


When  expanded,  this  equation  is  a  quadratic  equation  for  the  growth  factor 
dn  of  the  nth  mode.  The  roots  crn  depend  upon  the  diffusion  constants  cp  /?, 
the  equilibrium  solution  n,  T,  the  size  of  the  medium  L,  and  the  wavelength 
2 L/n  of  the  perturbation.  If  one  can  find  values  of  the  parameters  that 
make  one  of  the  roots  positive  or  have  positive  real  part,  then  there  is  an 
unstable  mode. 


3.  Apply  the  method  of  Exercise  2  to  examine  the  stability  of  the  steady  state 
of  the  Turing  system 

ut  =  Duxx  +  1  -  u  +  U2V,  vt  =vxx  +  2-  u2v,  0  <  x  <ir, 


under  no-flux  boundary  conditions.  Specifically,  write  the  condition  (5.36) 
and  determine  values  of  D  for  which  various  modes  (n)  are  unstable. 

4.  Consider  Fisher’s  equation  with  Dirichlet  boundary  conditions: 


ut 


u 


u 


ry  ry> 


+  u(  1  —  ix), 


7 r  7 r 

—  <  x  <  — , 

2  2 


7 r 


=  3  at  x  =  ±  — . 

2 


(a)  Show  that  ue(x)  =  1+^osx  is  a  nonconstant  steady  state  solution. 

(b)  Define  perturbations  U(x,t)  by  the  equation  u  =  ue(x)  +  U(x,t) 
and  find  the  linearized  perturbation  equation  and  boundary  conditions  for 
U  (x,  t). 
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(c)  Assume  a  solution  to  the  linearized  equation  of  the  form  U  =  eatg(x ) 
and  show  that  g  must  satisfy 


9 


// 


+ 


cos  x  —  5 
- 9  = 

1  +  COS  X 


0  =  0  at  x  =  =L  — . 
y  2 


(5.37) 


(d)  Show  that  if  (5.37)  has  a  nontrivial  solution,  then  a  <  0,  thereby 
showing  local  stability  of  the  steady  solution.  Hint:  Consider  two  cases, 
when  g  is  positive  and  when  g  is  negative  on  the  interval  and  then  examine 
at  the  signs  of  g"  and  the  other  terms  in  (5.37)  at  a  maximum  or  minimum 
point. 


6 

Numerical  Computation  of  Solutions 


Up  to  this  point  we  have  examined  PDEs  from  an  analytical  viewpoint,  often 
seeking  a  formula  for  the  solution.  Now  we  devote  a  brief  chapter  to  solving 
PDEs  numerically.  It  is  a  fact  that  in  industry  and  applied  science  PDEs  are 
almost  always  solved  numerically  on  a  computer;  most  real-world  problems  are 
too  complicated  to  solve  analytically.  And,  even  if  a  problem  can  be  solved 
analytically,  usually  the  solution  is  in  the  form  of  a  difficult  integral  or  an 
infinite  series,  requiring  a  numerical  calculation  anyway.  This  chapter  presents 
a  brief  introduction  to  one  method,  the  finite  difference  method.  There  are 
many  other  methods,  for  example,  the  finite  element  method,  to  mention  only 
one.  Numerical  methods  for  PDEs  have  been  and  continue  to  be  one  of  the  most 
active  research  areas  in  applied  mathematics,  computer  science,  and  the  applied 
sciences  as  investigators  seek  faster  and  more  accurate  algorithms.  Another 
feature  is  that  numerical  methods  give  tremendous  insight  into  the  basic  nature 
and  theory  of  PDEs.  The  reader  will  find  a  large  amount  of  literature  on  the 
subject;  it  is  a  field  within  itself.  Brief  introductions  to  finite  difference  methods 
can  be  found  in  Smith  (1978),  Logan  (1987),  and  Holmes  (2007). 

In  the  finite  difference  method  the  idea  is  to  replace  the  continuous  PDE 
by  a  discrete  algebraic  problem  that  can  be  solved  on  a  computer  in  finitely 
many  steps.  The  result  is  a  discrete  solution  which  is  known  approximately 
at  only  finitely  many  points  of  the  domain.  Simply,  partial  derivatives  in  the 
equation  are  replaced  by  their  difference-quotient  approximations;  this  leads 
to  a  difference  equation  relating  the  discrete  approximations  of  the  solution. 
We  then  develop  a  computer  code  to  solve  for  the  discrete  approximations. 
Although  the  procedure  is  straightforward,  it  does  not  come  without  risks. 
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If  care  is  not  taken,  the  procedure  can  lead  to  disastrous  results  even  when 
it  appears  that  the  discrete  approximation  to  the  continuous  problem  is  very 
sensible. 


6.1  Finite  Difference  Approximations 


Discrete  approximations  for  derivatives  can  be  obtained  by  Taylor’s  theo¬ 
rem,  one  of  the  most  important  results  in  calculus.  It  gives  both  the  form  and 
the  accuracy  of  the  approximation.  If  we  assume  a  function  /  is  sufficiently 
differentiable,  say,  having  n  +  1  continuous  derivatives  in  a  neighborhood  of  a 
point  x,  then  Taylor’s  theorem  states  that  for  h  sufficiently  small,  f(x)  can  be 
expanded  in  as 

f(x  +h)  =  f(x)  +  f\x)h  +  \f"(x)h2  +  \f"(x)h3  H - b  f[n)  {x)hn 

2  6  n\ 

where  x  is  some  point  between  x  and  x  +  h.  The  last  term  in  this  expansion  is 
the  error  term ,  and  we  say  it  is  order  n  +  1;  we  write  it  as  0(hn+1),  a  general 
expression  meaning  that  it  is  some  constant  times  the  factor  tin+1. 

The  reader  should  be  familiar  with  this  approximation  strategy  for  ini¬ 
tial  value  problems  for  ordinary  differential  equations.  We  review  the  simplest 
method. 


Example  6.1 

(Euler’s  Method)  The  problem  is  to  numerically  solve  the  initial  value  prob¬ 
lem  (IVP) 

y'  =  t0<t<T;  y(to)  =  yo, 

where  /  and  fy  are  continuous  in  both  arguments  in  an  open  neighborhood 
containing  the  initial  point  (to,2/o)-  These  conditions  guarantee  a  unique  solu¬ 
tion  to  the  IVP  on  an  interval  containing  to.  The  first  step  in  computing  a 
numerical  approximation  is  to  discretize  the  interval  to  <  t  <  T  by  defining  a 
finite  number  of  points  tn  =  to  +  n/i,  n  =  0,  1,  . . . ,  At,  where  h  is  the  step  size 
defined  by  h  =  (T  —  to)/N.  Thus,  tn+ i  =  tn  +  h.  At  each  discrete  point  tn  of 
the  domain  we  determine  an  approximation  Yn  to  the  exact  value  y(tn )  of  the 
solution  at  that  point.  From  Taylor’s  theorem  we  know  that 

y(tn+ 1)  =  y(tn)  +  hy'(tn)  +  O (h2)  =  y(tn )  +  hf(tn,  y(tn ))  +  O (h2). 
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If  we  replace  y(tn )  by  its  approximation  Yn  and  drop  the  O (h2)  term,  the 
so-called  truncation  error,  we  get 

Yn+i  =  Yn  +  hf(tn,Yn),  n  =  0,l,...,N,  Y0  =  y0. 


This  is  the  Euler  method ,  and  we  can  easily  compute  the  approximations 
Yi,  Y2 , . . recursively  from  n  =  0  to  n  =  N  —  1.  The  Euler  method  is  a 
marching  scheme  where  the  values  are  computed  successively,  forward  in  time. 
It  is  only  one  of  many,  many  schemes  designed  for  numerical  computation  of 
IVPs.  A  MATLAB  script  named  EulerMethod  at  the  end  of  the  chapter  carries 
out  this  iteration  process  for  the  initial  value  problem 


0.1  [y-  15 


12  cos 


1/(0)  =  68 


over  the  interval  0  <  t  <  72  using  1000  time  steps.  □ 


Solving  PDEs  numerically  requires  finite  difference  approximations  for  var¬ 
ious  partial  derivatives.  If  u  =  u(x ,  t)  is  a  sufficiently  smooth  function,  then 
Taylor’s  theorem  implies 


1  '  '  -  2  ,  1  ^,3 


u(x  +  h,t)  =  u(x,  t )  +  ux( x,  t)h  +  ^uxx(x,  t)h 2  +  ^uxxx(x,  t)h 6  +  O (ft4),  (6.1) 

where  h  is  an  increment  of  x.  Therefore,  taking  only  three  terms  on  the  right, 
we  obtain  the  forward  difference  approximation 


ux(x,t) 


u(x  +  A,  t) 
h 


u(x,  t ) 


H-  O(h). 


In  a  similar  way,  if  k  is  an  increment  of  t, 


.  x  u(x,  t  +  k)  —  u(x,  t)  ^/7X 
ut(x,  t)  =  - j- - +  O (ft). 

rv 

To  obtain  an  approximation  for  uxx ,  we  can  write  (6.1)  for  h  replaced  by  —  h 
to  obtain 

1  1 

u{x  —  h,t)  =  u(x,  t)  -  ux{ x,  t)h  +  -uxx(x,  t)h2  -  —uxxx(x,  t)h 3  +  O (ft,4). 

Adding  this  expression  to  (6.1)  and  solving  for  uxx  gives  the  centered  difference 
approximation 


u 


ry>  ry* 


u(x  —  h,  t)  —  2 u(x,  t )  +  u(x  -F  h,  t) 

Jy 


+  0(h2), 


which  is  an  approximation  for  the  second  derivative.  Similarly, 

u(x,t  —  k) —  2u(x,t)  +  u(x,t  +  k)  (  ^/72 


'U'tt 


k2 


+  0  (V). 
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6.2  Explicit  Scheme  for  the  Heat  Equation 


The  idea  of  marching  forward  in  time,  as  in  Euler’s  method,  goes  over  to  evolu¬ 
tion  problems  in  PDEs.  It  this  section  we  illustrate  and  explicit  finite  difference 
method  for  an  initial  boundary  value  problem  associated  with  the  diffusion 
equation: 


ut  =  Duxx,  0  <  x  <  1,  t  >  0,  (6.2) 

?i(0,  t)  =  tx(1,  t)  =  0,  t  >  0,  (6-3) 

u(x,  0)  =  /(#),  0  <  x  <  1.  (3.4) 

The  first  step  to  discretize  the  region  of  space-time  where  we  want  to  obtain 

a  solution.  In  this  case  the  region  is  0  <  £  <  1,0  <  t  <  T.  We  put  a  bound 
on  time  because  in  practice  we  only  solve  a  problem  up  until  a  finite  time. 
Discretizing  means  defining  a  two-dimensional  lattice  of  points  in  this  space- 
time  region  by 


xj  =  j/i,  tn  =  nk , 


n 


0,1, ...,7V, 


where  the  fixed  numbers  h  and  k  are  the  spatial  and  temporal  step  sizes ,  respec¬ 
tively,  given  by  h  =  1/J  and  k  =  1/7V.  The  integer  J  is  the  number  of  subin¬ 
tervals  in  0  <  x  <  1,  and  TV  is  the  number  of  time  steps  to  be  taken.  Figure  6.1 
shows  the  lattice  of  points;  this  lattice  is  also  called  a  grid,  and  the  points  are 
called  nodes.  At  each  lattice  point  (xj,  tn)  we  seek  an  approximation,  which  we 
call  Z7J1,  to  the  exact  value  u(xj,tn )  of  the  solution.  Note  that  the  superscript  n 
on  Uj1  is  an  index  referring  to  time,  not  an  exponent;  and  the  index  subscript  j 
refers  to  space.  We  can  regard  the  grid  function  U'r-J  as  a  two-dimensional  array, 
or  matrix,  where  n  is  a  column  index  and  j  is  a  row  index.  To  obtain  equations 
for  the  approximations  Z7J1,  we  replace  the  partial  derivatives  in  the  PDE  by 
their  difference  approximations  obtained  in  Section  6.1.  So  the  continuous  PDE 
(6.2)  at  the  point  ( Xj,tn )  is  replaced  by  the  difference  equation 


Uj+1-U?  BUh_^Ul  +  UJ±l 

k  h 2 

where  we  have  dropped  the  error  terms.  Solving  for  U‘-'+  gives 

Vn+1  =  uf  +  r  ([/”_!  -  2C7J*  +  U?+1) ,  (6.5) 


where 


r  = 


kD 

ft5"’ 


Observe  that  the  difference  equation  (6.5)  relates  the  approximate  values  of  the 
solution  at  the  four  points  (xj_i,tn),  (xj,tn),  (xj-i , £n),  (xj,tn+ 1).  These  four 
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Figure  6.1  The  discrete  lattice,  or  grid,  and  the  computational  atom  of  the 
explicit  scheme  for  diffusion  equation.  The  atom  permits  calculation  of  /7™+1 
at  the  (n  +  l)st  time  level  in  terms  of  the  three  values  Ut-_  1 ,  U /7j7+ 1  at  the 
previous  time  level  n.  So  we  can  fill  up  an  entire  grid  row  by  knowing  the  grid 
values  at  the  preceding  row 


points  form  the  computational  atom  for  the  difference  scheme  (see  Figure 

6.1). 

The  difference  equation  (6.5)  gives  the  approximate  solution  at  the  node 
(xj,tn+ 1)  in  terms  of  approximations  at  three  earlier  nodes.  Now  we  see  how 
to  fill  up  the  lattice  with  approximate  values.  We  know  the  values  at  t  =  0 
from  the  initial  condition  (6.4).  That  is,  we  know 

Uj  =  f(xj),  j  =  0, 1, . . . ,  J. 

From  the  boundary  conditions  (6.3)  we  also  know 

Uq  =  0,  UJ  =  0,  n  =  1,2,..., TV. 

The  difference  formula  (6.5)  can  now  be  applied  at  all  the  interior  lattice  points, 
beginning  with  the  values  at  the  t  —  0  level,  to  compute  the  values  at  the  t  =  t\ 
level,  then  using  those  to  compute  the  values  at  the  t  =  £2  level,  and  so  on. 
Therefore,  using  the  difference  equation,  we  can  march  forward  in  time,  contin¬ 
ually  updating  the  previous  temperature  profile.  Think  of  filling  out  the  array 
[/j1,  row  by  row,  i.e. ,  time  after  time,  in  a  marching  type  scheme.  In  physical 
terms,  the  numerical  solution  carries  the  initial  temperature  distribution  for¬ 
ward  in  time;  along  each  horizontal  time  line  n,  we  can  think  of  the  j  values, 
FJ\  j  =  0,1,2,  ...,J,  as  an  approximate  time  profile. 
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A  finite  difference  scheme,  or  algorithm,  like  (6.5)  is  called  an  explicit 
scheme  because  it  permits  the  explicit  calculation  of  an  approximation  at  the 
next  time  step  in  terms  of  values  at  a  previous  time  step.  Because  of  the  error 
(called  the  truncation  error)  that  is  present  in  (6.5)  due  to  replacing  derivatives 
by  differences,  the  scheme  is  not  always  accurate.  As  it  turns  out,  if  the  time 
step  k  is  too  large,  then  highly  inaccurate  results  will  be  obtained.  It  can  be 
shown  that  we  must  have  the  stability  condition 

kD  1 
r  =  — —  <  — 

h2  ~  2 

for  the  scheme  to  converge.  This  condition  constrains  the  time  step  according 
to  k  <  0.5h2 / D.  Experiments  that  illustrate  this  behavior  are  suggested  in  the 
exercises. 

It  is  straightforward  to  code  the  explicit  algorithm  (6.5)  described  above 
to  calculate  approximate  temperatures  in  a  rod.  A  simple  MATLAB  program 
listing  entitled  ExplicitHeat  is  given  at  the  end  of  the  chapter.  The  solution 
surface  is  plotted  in  Figure  6.2. 


2  0 

Figure  6.2  Numerical  solution  to  the  problem  (6.2),  (6.3),  (6.4)  with  initial 
temperature  f(x)  =  x3(l  —  x),  diffusivity  D  =  0.02,  on  0  <  x  <  1,  0  <  t  <  2 
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von  Neumann  Stability  Analysis 

There  are  three  sources  of  errors  in  applying  finite  difference  schemes  to 
differential  equations:  (1)  truncation  errors ,  which  measure  the  errors  in  replac¬ 
ing  the  differential  equation  by  a  difference  equation;  (2)  errors  propagated  in  a 
computation  using  the  difference  scheme  itself;  (3)  roundoff  errors  accumulated 
from  the  computation  because  of  the  finite-digit  arithmetic,  or  representation 
of  real  data. 

We  have  seen  that  truncation  errors  are  proportional  to  the  step  sizes  taken 
to  approximate  derivative,  e.g.,  0(h2),  etc.  The  origin  of  roundoff  error  is  a  topic 
we  leave  to  other  sources,  for  example,  computer  science.  We  focus  here  on  (2), 
that  is,  how  does  a  difference  equation  itself  propagate  incorrect  information? 
Even  if  the  difference  approximation  has  a  very  high  accuracy  measured  by  the 
truncation  error,  we  show  that  the  difference  scheme  can  still  lead  to  horrendous 
results.  The  analysis,  which  is  one  of  several,  is  called  von  Neumann  stability 
analysis.  It  deals  with  the  issue  of  stability,  which  was  discussed  earlier  in  the 
text;  namely,  if  an  initial  error  is  small,  does  the  error  at  later  times  remain 
small  when  propagated  by  the  difference  scheme? 

Consider  the  problem 

Ut  —  uxx  =  0,  x  G  R,  t  >  0,  u(x,  0)  =  cosax,  x  G  R, 

where  a  >  0  can  regarded  as  a  wave  number,  or  the  number  of  oscillations  per 
2tt.  This  problem  can  be  solved  exactly,  and  the  solution  is 

u(x,  t)  =  e  cosax. 

Clearly  u  remains  bounded  as  t  oo.  Next,  let  us  set  up  this  problem  numer¬ 
ically  using  the  explicit  scheme.  For  the  infinite  domain  t  >  0  take  Xj  =  ±jh, 
j  =  0, 1,  2, . . .,  tn  =  nk ,  n  =  0,1,2,....  The  PDE  is  approximated  by  the 
difference  equation  (6.5), 

Vn+i  =  u™  +  r  (cy.j  -  2 ty  +  u?+1) . 

The  initial  condition  gives 

Uj  =  cosaxj,  j  =  0,  ±1,  ±2, .... 

Think  of  the  initial  condition  as  defining  a  distribution  of  errors  at  time  t  —  0. 
We  look  for  a  complex  solution  of  the  discrete  problem  in  the  form 

U™  =  MneiaXj , 
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where  the  amplitude  factor  M  is  to  be  determined.  (Note:  we  can  recover  the 
real  solution  by  taking  the  real  part  of  the  complex  solution.)  Substituting  this 
into  the  difference  equation  gives 


M  =  r  —  1  +  2r  cos  ah 

n  ,  •  2 

=  1  —  4r  sin  — . 

2 


Therefore,  the  real  solution  is 


4r  sin' 


n 

cos  axj . 


It  is  clear  that  U  will  be  bounded  whenever 


.  ,  .  2  ah 

1  —  4r  sm  —  <1  or  r  < 


1 


2  sin 


2  ah 


If  r  <  \  this  is  guaranteed.  If  on  the  other  hand  r  >  then  there  is  always  a 
value  of  a  for  which  |  M  |>  1.  It  follows  that  the  explicit  scheme  with  r  >  |  is 
not  stable  because  bounded  initial  conditions  are  propagated  by  the  scheme  so 
that  they  become  unbounded.  For  r  <  |  we  say  the  scheme  is  von  Neumann 
stable. 


Example  6.2 

The  following  diagram  shows  the  result  of  applying  the  explicit  scheme  (6.5) 
in  the  simple  case  r  =  1,  which  is  an  unstable  case.  In  this  case,  the  difference 
equation  becomes 

up+1  =  -up  +  c/”_1  +  up+1. 

At  time  to  =  0  we  assume  there  is  an  initial  error  e  at  ro  =  0.  Using  the 
scheme,  the  values  of  C/j1,  j  =  0,  ±1,  ±2,  ±3, . . .  were  computed  (by  hand)  for 
subsequent  time  lines  U,  £2,  etc*  One  can  observe  that  the  error  propagates, 
spreading  and  growing  in  time.  It  would  make  a  numerical  approximation  com¬ 
pletely  invalid.  □ 
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Example  6.3 

(Explicit  Scheme  for  the  Wave  Equation)  This  same  type  of  explicit 
marching  procedure  that  worked  for  the  heat  equation  works  for  initial  value 
problems  associated  with  the  wave  equation.  Consider  the  problem 

uu  =  c2uxx,  0  <  x  <  1,  t  >  0,  (6.6) 

ti(0,  t)  =  u(l,  t)  =  0,  t>  0,  (6-7) 

u(x,0)  =  f(x),  Ut(x,  0)  =  g(x),  0  <  x  <  1.  (6.8) 

First  we  discretize  the  space-time  domain  as  before  and  form  a  lattice  Xj  = 

jh,  tn  =  n/c,  j  =  0, 1, . . . ,  J,  n  =  0, 1, . . . ,  N.  We  can  approximate  the  PDE 
by 

yn_1  -  2cy  +  yn+1  _  2  q*_i  -  2ty  +  ty+1 
P  -c  P  ’ 

which  leads  to 


2(1  -  s2)t/”  +  s2{t/”_,  +  t/"+1) 


/yn_1 

uj  ’ 


where 


s  = 


ck 

~h' 


The  computational  atom  for  this  explicit  scheme  is  shown  in  Figure  6.3.  To 
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Figure  6.3  Computational  atom  for  the  wave  equation.  The  values  at  the 
(n  +  l)st  time  level  depend  on  the  two  previous  times,  n  and  n  —  1 


compute  the  value  at  the  (n  +  l)st  time  step  we  now  need  values  from  the 
previous  two  time  steps.  Therefore,  to  start  the  marching  scheme  we  require 
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values  from  the  first  two  (to  =  0  and  t\  =  k)  time  rows.  The  t  =  0  row  is  given 
by  the  initial  condition  u(x,  0)  =  /(#);  so  we  have 

Uj  =  f{xj),  j  =  0, 1, . . . ,  J. 

The  t  =  t\  time  row  can  be  computed  using  the  initial  velocity  condition 
Ut(x,  0)  =  g(x),  which  we  approximate  by  a  forward  difference  formula, 

U1  -  U° 

1  k  1  =  3  =  1,2, J-  1. 

Thus,  the  values  Uj  at  the  second  row  are 

Uj  =  Uj  +  kg{xj). 

Now  we  have  the  ingredients  to  start  the  scheme  and  march  forward  in  time 
to  fill  up  the  entire  grid  with  values,  marching  forward  from  time  row  to  time 
row.  Again  there  is  a  stability  condition,  namely,  s  <  1,  or 


This  is  the  CFL  condition,  or  the  Courant-Friedrichs-Lax  condition.  Physi¬ 
cally,  it  states  that  the  speed  h/k  at  which  lattice  points  are  being  calculated 
must  exceed  the  speed  c  that  waves  are  propagated  in  the  system.  Thus  the 
calculated  values  at  the  lattice  points  at  a  given  time  level  will  contain  all  of 
the  information  about  the  wave  at  the  previous  time  level.  We  ask  the  reader 
to  write  a  code  for  the  explicit  scheme  to  solve  the  wave  equation.  □ 

Example  6.4 

(Other  boundary  conditions)  In  the  explicit  marching  scheme  for  the  heat 
equation  we  considered  fixed,  Dirichlet  boundary  conditions.  For  example,  this 
forces  the  values  Uq  along  the  grid  points  (0,  n)  to  be  fixed  along  the  left 
boundary.  If,  however,  the  left  boundary  has  a  Neuman  condition,  then  we 
do  not  know  Uq  a  priori ,  and  the  values  must  be  calculated.  For  illustration, 
assume  a  boundary  condition  of  the  form 

ux(0,t)  =  b(t),  t  >  0. 

Using  the  spatial  grid  Xj  =  jh ,  j  =  0,1,2,...,  as  before,  we  discretize  this 
condition  by 

h 

for  each  n.  In  terms  of  the  grid  function  Z7,  this  is 
Uf  -  U0n 


h 


b(tn),  or  Uq  =  Ui  —  hb(tn ),  n—  1,2,3, - 
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Therefore,  given  the  values  along  the  nth  row,  the  calculation  of  the  values 
along  the  (n  +  l)st  row  is 

for  j  =  1  to  J  -  1;  t/”+1  =  Uf  +  r(Ujl_1  -  2U?  +  £/"+1);  end 

K+1  =  url  ~  hb(tny, 

Similar  conditions  can  be  derived  for  radiation  boundary  conditions.  The  wave 
equation  can  be  handled  in  a  similar  way.  □ 


EXERCISES 

1.  Use  Euler’s  method  with  step  size  h  =  0.1  to  numerically  solve  the  initial 

value  problem  y'  =  —2 ty  -f-  t/2,  7/(0)  =  1  on  the  interval  0  <  t  <  2. 

Compare  your  approximations  with  the  exact  solution. 

2.  Use  the  notation  in  this  section,  derive  the  centered  difference  approxima¬ 
tion  to  the  first  derivative, 

t|,M  =  "(x  +  ft)-.(x-ft)+o(ft2) 

3.  Develop  an  explicit  scheme  to  solve  the  heat  conduction  problem 

Uf  —  U xx ?  0  X  <C  1,  t  0, 

iz(0,t)  =  l,  ux(l,t)  =  -(u(l,t)  -  g(t)),  t  >  0, 
u(x ,  0)  =  0,  0  <  x  <  1. 

Hint:  Approximate  the  radiation  condition  ux(l,  t)  =  —  —  g(t ))  by 

Pick  ^f(t)  =  1  and  compute  an  approximate  solution  surface.  Finally, 
describe  the  physical  context  of  this  problem. 

4.  Write  a  program  in  some  programming  language  or  in  a  computer  algebra 
package  to  solve  the  hyperbolic  problem  (6.6)-(6.8)  when  f(x)  =  0  and 
g(x)  =  x(l  —  x)  with  c  =  0.25.  Take  h  =  0.125  and  experiment  with 
various  time  step  sizes  k  and  illustrate  the  validity  of  the  CFL  condition. 
Compare  with  the  exact  solution. 

5.  Consider  the  explicit  method  for  the  wave  equation  wave  equation  in  x  E 
M,  t  >  0  with  CFL  parameter  s  =  2.  Take  the  initial  condition  f(pc)  at 
the  discrete  grid  points  Xj  to  be  . . . ,  0,  0,  0, 1,  2, 1,  0,  0,  0, . . .  and  assume 
Ut(x,  0)  =  0  for  all  x.  By  hand,  compute  the  values  of  U J  for  the  four  time 
levels  n  =  1,2,  3, 4.  Next  make  the  same  calculation  with  5  =  1.  Discuss 
your  observations. 
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6.  Consider  the  Cauchy  problem  for  the  advection  equation,  ut  +  cux  =  0, 
where  c  >  0. 


a)  Expand  u(x,t  +  k)  in  a  Taylor  series  up  to  O (k3)  terms.  Then  use  the 
advection  equation  to  obtain 

c2k2 

u(x,  t  +  k)  =  u(x,  t)  —  cku(X ,  t)  H - uxx(x,  t )  +  0(/c3). 


b)  Replace  ux  and  uxx  by  centered  difference  approximations  to  obtain 
the  explicit  scheme 

1  1  pic 

U?+1  =  (1  -  S2)U J  +  -(1  +  s)U^  -  -(1  -  s)U?+1,  S  = 

This  is  the  Lax— Wendroff  method.  It  is  von  Neumann  stable  for 
0  <  s  <  1  and  it  is  widely  used  to  solve  both  linear  and  nonlinear 
first-order  hyperbolic  systems. 

c)  Use  the  Lax- Wendroff  method  to  numerically  solve  ut  +  3^  =  0  in  the 
upper  half-plane  where  u(x,  0)  =  2x(2  —  x)  if  0  <  x  <  2  and  u(x,  0)  =  0 
otherwise.  Take  h  =  0.2,  k  =  0.05.  Compare  your  answer  to  the  exact 
solution. 


6.3  Laplace’s  Equation 

Next  we  solve  a  Dirichlet  problem  for  Laplace’s  equation  in  the  unit  square  with 
prescribed  values  on  the  boundary  of  the  square.  This  Dirichlet  problem  is  a 
pure  boundary  value  problem,  and  we  should  not  expect  an  explicit  marching- 
type  scheme  to  work;  there  is  no  guarantee  that  when  we  march  from  one 
boundary  in  the  problem  to  another  we  will  reach  the  already  specified  values 
at  that  boundary.  This  means  that  we  must  solve  for  the  approximate  values 
at  all  of  the  interior  lattice  points  simultaneously.  To  illustrate  the  procedure 
we  consider  the  Dirichlet  problem 

Uxx  +  Uyy=0,  0  <  x  <  1,0  <  y  <  1,  (6.9) 

u(0,y)  =  u(l,y)  =  0,  0  <  y  <  1,  (6.10) 

u(x,  0)  =  f(x),  u(x,  1)  =  0,  0  <  x  <  1.  (6-11) 

We  discretize  the  unit  square  by  defining  lattice  points 

Xi  =  ih,  yj  =  jk,  i  =  0,1,...,/;  j  =  0, 1,...,J, 
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where  ft  =  1/7  and  k  =  1/J  are  the  fixed  step  sizes  in  the  x  and  y  directions, 
respectively.  We  let  Uij,  with  two  lower  subscripts,  denote  the  approximate 
value  of  u(xi,yj).  Then  Laplace’s  equation  can  be  approximated  at  the  lattice 
point  ( Xi,yj )  by  the  difference  equation 

~  2Uj,j  +  Uj+ij_  Uj,j-i  —  2Uj,j  +  Ujj+i  _  p 

ft2  k2 

The  reader  may  wish  to  review  Section  1.8,  where  a  similar  approximation  is 
made  when  ft  =  k.  The  difference  equation  can  be  rewritten  as 

V,j  =  2fc2  +  2 ft.2  ^■Ui~1’j  +  Ui+1^  +  2fc2  +  2 h?  ^ Ui,j~ 1  +  Ui’j+1^' 

The  computational  atom  is  shown  in  Figure  6.4.  The  difference  equation  relates 
the  node  at  the  center  of  the  atom  to  four  adjacent  nodes.  As  we  observed  in 
Section  1.8,  if  h  =  fc,  then  this  difference  equation  states  that  the  value  at  the 
center  node  is  approximately  the  average  of  the  values  at  the  four  adjacent 
nodes;  if  ft  ^  fc,  the  approximate  value  at  the  center  is  a  weighted  average  of 
the  four  surrounding  values. 


Figure  6.4  Computational  atom  for  Laplace’s  equation 


The  strategy  is  this.  The  values  of  Uij  are  known  at  the  nodes  on  the 
boundary  of  the  square.  We  want  to  find  the  values  at  the  (7  —  1)  x  (j-i) 
interior  lattice  points.  Therefore,  we  apply  the  difference  equation  (6.12)  to 
each  interior  lattice  point,  i.e.,  for  i  =  1, . . . ,  I  —  1;  j  =  1, . . . ,  J  —  1.  The 
result  is  a  linear- algebraic  system  consisting  of  (7  —  1)  x  (J  —  1)  equations  in 
the  (7  —  1)  x  (J  —  1)  unknowns  Uij.  Therefore,  we  have  reduced  the  Dirichlet 
problem  to  the  linear  algebra  problem  of  solving  for  grid  function  Uij  at  the 
interior  nodes. 
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Figure  6.5  Discrete  lattice 


Example  6.5 

In  (6.11)  take  f(x)  =  x3(l  —  x).  To  illustrate  the  procedure  take  7  =  4  and 
J  =  3;  thus  j  =  1/4  and  k  =  1/3.  See  Figure  6.5.  Then 

2fc2  +  2  ft,2  =  °'18,  2fc2  +  2  h?  =  °'32' 

There  are  six  interior  lattice  points  where  the  solution  is  to  be  found.  By 
the  boundary  conditions,  the  values  along  the  top  and  two  sides  are  zero, 
and  the  values  along  the  lower  boundary  are  Uo,o  =  0,  T/yo  =  0.0117,  7/2,0  = 
0.0625,  7/3,0  =  0.1054,/745o  =  0.  Now  we  apply  the  difference  equation  (6.12) 
successively  at  the  six  points  (i,j)  =  (1, 1),  (1,  2),  (2, 1),  (2,  2),  (3, 1),  (3,  2)  to 
obtain  the  six  coupled  algebraic  equations 

7/i,i  =  0.32(0  +  7/2,1)  +  0.18(0.0117  +  7/1,2), 

7/i,2  =  0.32(0  +  7/2,2)  +  0.18(7/i,i  +  0), 

7/2,1  =  0.32(7/i,i  +  7/3,1)  +  0.18(0.0625  +  7/2,2), 

7/2,2  =  0.32(7/1,2  +  7/3,2)  +  0.18(7/2,1  +  0), 

7/3,1  =  0.32(7/2,1  +  0)  +  0.18(0.1054  +  7/3,2), 

7/3,2  =  0.32(7/2,2  +  0)  +  0.18(7/3,1  +  0). 

We  can  solve  these  six  linear  equations  to  obtain  the  approximate  solution  at 
the  six  interior  nodal  points: 

7/i,i  =  0.007,  7/1,2  =  0.003,  7/2,1  =  0.015, 

7/2,2  =  0.006,  7/3,1  =  0.024,  7/3,2  =  0.006.  □ 
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In  the  previous  example  we  solved  the  linear  system  easily  on  a  calculator. 
But  if  the  lattice  has  a  large  number  of  nodes,  which  is  the  usual  case,  then 
direct  solution  is  inefficient,  even  though  the  coefficient  matrix  for  the  system 
is  tridiagonal  (nonzero  only  on  the  sub-,  super-  and  main  diagonals).  Another 
way  to  solve  a  linear  system  is  by  an  iterative  process  known  as  the  Gauss— 
Seidel  method.  In  this  method  we  fix  the  values  of  the  array  Uij  at  the  known 
boundary  points  and  then  initialize  the  array  at  the  interior  lattice  points  by 
setting  them  to  zero,  or  setting  them  to  values  that  might  approximate  the 
solution.  Then  we  systematically  cycle  through  all  the  interior  lattice  points, 
replacing  the  old  estimates  with  new  ones  calculated  by  the  difference  equation 
(6.12).  We  can  cycle  along  rows  or  columns,  but  it  is  common  to  cycle  row  by 
row,  from  bottom  to  top.  After  a  number  of  cycles,  the  estimates  will  converge 
to  the  solution  of  the  linear  system.  One  can  terminate  the  iteration  process 
when  the  change  from  one  complete  iteration  to  the  next  is  small,  measured  in 
some  manner.  This  process  is  extremely  simple  to  program,  and  the  cycling  is 
accomplished  in  a  few  lines  of  code.  Actually,  there  are  several  ways  the  iteration 
procedure  can  be  accelerated,  and  we  refer  the  reader  to  the  references  and  the 
key  words  successive  overrelaxation  (SOR). 

0.25  > 


0.2- 


0  0 


Figure  6.6  Solution  to  the  BVP  (6.9),  (6.10),  (6.11)  for  Laplace’s  equation 
when  f(pc)  =  x(l  —  x) 


Here  is  a  basic  outline  of  the  Gauss-Seidel  method.  After  putting  in  the  pre¬ 
scribed  values  on  the  boundary  and  initializing  the  the  array  Uij  (to  zero,  say) 
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at  the  interior  grid  points,  we  perform  the  following  row-by-row  iteration  using 
three  nested  loops: 

for  n  from  1  to  number  of  iterations; 
for  i  from  1  to  I- 1 ; 
for  j  from  1  to  J-l; 

U[iJ\  =kA2/(2*kA2+2*hA2)*(U[i-l,j]-hU[i+l,j]) 
+hA2/(2*kA2+2*hA2)*(U[i,j-l]+U[i,j-bl]); 
end;  end;  end; 

The  MATLAB  code  Laplace  is  at  the  end  of  the  chapter.  We  take  I  —  20 
and  J  =  10,  giving  19  x  9  =  171  interior  lattice  points.  Figure  6.6  shows  the 
solution  surface  after  20  iterations  through  the  grid. 

Neumann  and  radiation  boundary  conditions  can  be  handled  similarly  as 
in  the  heat  and  wave  equations.  See  the  Exercises. 


EXERCISES 

1.  In  Example  6.5  write  the  linear  system  for  the  six  unknowns  Uij  in  stan¬ 
dard  matrix  form,  Ax  =  b,  where  x  is  the  column  vector  of  unknowns,  and 
A  is  the  coefficient  matrix.  Note  that  A  is  a  tridiagonal  matrix. 

2.  Write  a  finite  difference  code  to  solve  the  following  steady-state  problems: 

a) 


u 


xx  +  ^ yy  —  0,  0<CrC2,  Oc^/Cl, 

u(0,y)  =  2sin7ry,  u{2,y)=0,  0  <  y  <  1, 
u(x,  0)  =  x(2  —  x),  u(x,  1)  =  0,  0  <  x  <  2. 


^ xx  T  ^ yy  —  0,  0  x  2,  0  <C  2/  <C  1 , 

M°>  y)  =  0,  u(  2,  y)  =  0,  0<y<l, 
u(x,  0)  =  x(2  —  x)2 ,  u(x,  1)  =  0,  0  <  x  <  2. 

To  handle  the  insulated  boundary  condition  at  x  =  0  in  (b),  use  a  forward 
difference  approximation;  note  that  you  must  calculate  the  solution  along 
that  boundary. 

3.  Consider  Laplace’s  equation  on  the  triangle  with  vertices  (0,  0),  (1,  0),  (0, 1). 
Use  a  lattice  with  spacing  h  =  k  =  If  the  oblique  edge  is  insulated,  find 
a  difference  approximation  at  a  point  (xj,  y^)  on  that  boundary.  If  the  left 
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edge  is  held  at  1  and  the  lower  edge  is  held  at  0,  find  the  matrix  equation 
for  the  unknown  grid  points.  (Recall,  the  oblique  boundary  points  must  be 
calculated.) 


6.4  Implicit  Scheme  for  the  Heat  Equation 


In  an  explicit  difference  scheme  we  compute  a  new  time  values  of  the  grid 
function  UJ+1  explicitly  only  in  terms  of  the  previous  time  values  Uf1 .  As  we 
found  in  the  explicit  method  for  the  heat  equation,  we  were  forced  to  take 
very  small  time  to  insure  stability.  There  are  other  difference  schemes  that 
are  implicit  in  nature.  In  these  we  are  not  able  to  solve  for  a  new  time  value 

explicitly,  but  rather  we  obtain  a  system  of  algebraic  equations  at  the  new 

time  for  all  the  values  simultaneously.  These  methods  typically  have  a  strong 
advantage:  they  are  stable  without  putting  a  constraint  on  the  size  of  time  step. 
A  disadvantage  is  that  coding  such  schemes  is  a  little  more  difficult. 

In  this  section  we  set  up  such  a  scheme  for  the  heat  equation.  Consider  the 
initial  BVP  problem  for  the  heat,  or  diffusion,  equation  presented  in  Section  6.2: 

ut  =  Duxx ,  0  <  x  <  1,  0  <  t  <  T,  (6.13) 

i^(0,  t)  =  u(l,  t)  =  0,  t  >  0,  (6.14) 

u(x,  0)  =  /(#),  0  <  x  <  1.  (6.15) 

The  first  step  to  discretize,  as  before,  the  region  of  space-time  where  we  want 
to  obtain  a  solution.  In  this  case  the  region  is0<x<l,0<£<T.  Therefore 
we  define  a  lattice  of  points  in  this  space-time  region  by 


Xj  =  jh ,  tn  =  nk , 


n  =  0,1, 


where  the  fixed  numbers  h  and  k  are  the  spatial  and  temporal  step  sizes  given 
by  h  =  1  /  J  and  k  =  1/N.  For  an  implicit  scheme  we  approximate  the  time 
derivative  as  before  by  a  forward  difference 

/  ,  \  u(xj ,  tn-\- 1  —  u(Xj  An)  \ 

'U't  \%j  ?  tn)  —  ^  T  0(/c) . 

But  for  the  second  spatial  derivative  uxx  at  (xj,tn),  rather  than  a  centered 
difference  at  time  tni  we  take  a  weighted  average  of  two  centered  difference 
approximations,  one  at  tn  and  one  at  tn+\.  Therefore, 


^n(b)^)  —  (1  ^) 


u{xj—\ ,  tn_j_ i )  2u(xj ,  tn- |_i )  T  u{xj- |_i ,  tn- |_i ) 

h2 


_l_  QU{xj-i’>tn)  2 u(xj,tn)  +  u(xj+i ,tn)  ^  0(h2) 
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where  0  <  6  <  1.  Therefore,  the  computational  molecule  is  shown  in  Figure 
6.7.  Substituting  these  approximations  into  the  PDE  (6.13)  using  U J  as  the 


Figure  6.7 

scheme 


tn  +  1^" 


n 


L 


T 


i 


xj- 1 


xj 


V  i 


A  six-node  computational  molecule  for  the  implicit  difference 


approximation  for  u(xj,tn ),  we  obtain  the  finite  difference  equation 


Tjn- 1-1  T jn 

3 _ _3_ 

k 


DO 


Up- 11  -  2f/”+1  +  U?P 


3 


3  + 1 


h2 


+  £>(1-0) 


TTn 
U3~  1 


h2 


We  have  dropped  the  truncation  error  terms.  We  rearrange  the  terms  in  this 
equation  to  obtain 


-OrUppil  +  2 rO)Upl  -  OrUp l1 

=  (1  -  eyup  +  [1  -  2(1  -  0)r]p  +  (1  -  0)rup,  (6.16) 


where 


r  = 


kD 

~h?' 


Notice  that  when  0  =  0  this  scheme  reduces  to  the  classical  explicit  scheme; 
when  0  =  the  scheme  is  called  the  Crank— Nicolson  scheme.  We  have 
written  the  three  unknown  values  7/j^1 ,  at  the  (n  +  l)st  time 

level  in  terms  of  the  three  known  values  at  the  nth  time  level.  For  each  fixed  n, 
we  have  to  solve  this  equation  for  j  =  1,  2, . . . ,  J  —  1.  Therefore  equation  (6.16) 
represents  a  system  of  J—  1  equations  for  the  J— 1  values  Z7^+1,  Z7J+1, . . . ,  Uj+- x . 
Each  equation  has  three  unknowns.  In  matrix  form  (6.16)  can  be  written  as  a 
tridiagonal  system 


/  1  +  2  rO  —rO  \ 

(  up 1  \ 

/  Ff  \ 

—rO  1  +  2  rO  —rO 

Up 1 

Ff 

—rO  1  +  2  rO  —rO 

^3n+1 

— 

F3" 

—rO 

^j-2 

y  —  rO  1  -j-  2 rO  J 

upi  ) 

where  the  right  side  involves  the  terms  on  the  right  of  (6.16)  and  the  bound¬ 
ary  conditions.  This  system  must  be  solved  at  each  time  step,  but  there  are 
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no  constraints  on  the  time  step  as  in  explicit  methods.  This  scheme  is  stable 
unconditionally.  As  it  turns  out,  there  are  simple  direct  algorithms  to  solve 
tridiagonal  systems  (see  the  references,  MATLAB  manuals,  or  other  computer 
algebra  systems).  In  addition,  there  are  iterative  methods  similar  to  the  Gauss- 
Seidel  method  that  can  be  applied  at  each  time  step  to  solve  the  system. 


EXERCISES 

1.  Use  the  Crank-Nicolson  scheme  with  6  =  0.5,  D  =  1,  h  =  0.2,  k  =  0.08 
and  write  out  the  scheme  (6.16)  for  n  =  0  when  the  initial  condition  (6.15) 
is  given  by  f(pc)  =  x(l  —  x).  Write  the  tridiagonal  system  explicitly  (there 
are  four  unknowns)  and  solve  it  to  find  the  solution  at  the  time  line  n  =  1. 

2.  Use  the  Crank-Nicolson  scheme  to  numerically  solve  the  problem 

ut  =  uxx  +  u,  0  <  x  <  1,  t  >  0, 
u(0,t)  =  u(l,t)  =  0,  t  >  0, 
u(x,  0)  =  1,  0  <  x  <  1. 


Take  h  =  k  =  0.2. 

3.  Consider  the  BVP  for  the  advection  equation 

ut  +  cux  =  0,  x  G  R,  (c  >  0), 
with  u(x,  0)  =  /(#),  x  >  0,  u(0,t)  =  g(t),  t  >  0. 

a)  Using  the  three-point  molecule  (xj,tn),  (ay_i,tn+i),  (oq-,tn+i),  derive 
the  implicit  difference  equation  approximation 


(i  +  s)yn+1  -  st/jy1 


s  = 


ch 

¥‘ 


b)  Show  that  the  scheme  is  von  Neumann  stable. 

c)  Develop  an  algorithm  to  compute  U J  at  lattice  points  in  the  first  quad¬ 
rant.  Explain  carefully  your  procedure. 


MATLAB  Program  Listings 

Programming  note :  MATLAB,  like  other  computer  algebra  systems,  does 
not  recognize  an  index  with  value  0.  Many  of  the  formulas  in  the  text  require,  for 
example,  ranges  such  as  j  =  0, 1,  2, . . . ,  J.  For  MATLAB  expressions  we  must 
shift  the  indices  and  use  j  =  1,2,...,J  +  1.  This  is  an  unfortunate  nuisance  in 
writing  programs. 
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function  EulerMethod 

f=@(t , Y)  -0.1* (Y- (15+ 12* cos (3 . 14*t/ 12) ) ) 

t0=0 ;  Y0=68 ;  T=72;  N=1000;  h=(T-tO)/N;  Y=zeros (N+l) ;  Y(1)=Y0; 
for  n=l:N 
t(n)  =  a+(n-l)*h; 

Y (n+l)  =  Y (n) +h*f (t (n) , Y (n) ) ; 
end 

t=tO:h:T;  plot(t,Y),  xlabel( ’time  in  hours’), 
ylabel( ’temperature  Y  in  deg  F’), 
ylim([0  70]),  title ( ’Newton  Law  of  Cooling’) 


function  ExplicitHeat 

xmax=l;  J=20;  h=xmax/J;  tmax=2;  N=50;  k=tmax/N; 

D=0 . 02 ;  r=D*k/(h*h) 

U=zeros ( J+l ,N+1) ; 
f=@(x)  x.A3.  x* (1-x) ; 
for  j  =  l:J+l;  U( j , l)=f ( ( j-1) *h) ;  end 
for  n=l : N+l ;  U(l,n)=0;  U(J+l,n)=0;  end 
for  n=l:N 
for  j=2:J 

U(j,n+l)=U(j  ,n)+r*(U(j-l  ,n)-2*U(j  ,n)+U(  j  +  l  ,n) )  ; 
end 
end 

[T,X] =meshgrid(0 :h:xmax,0 :k: tmax) ; 
mesh(X’ ,T’ ,U, ’EdgeColor’ , ’black’) 

xlabel ( ’ t ’ , ’ FontSize ’ , 14) , ylabel ( ’ x ’ , ’ FontSize ’ , 14) 
title ( ’Explicit  Scheme  for  Heat  Equation’) 
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function  Laplace 

°/0  Gauss — Seidel  method 

xmax= 1 ;  ymax= 1 ; 

1=20;  J=10 ;  h=l/I;  k=l/J;  Numit=20; 

r=0 . 5*kA2/  (kA2+hA2)  ;  s=0 . 5*hA2/  (kA2+hA2)  ; 

U=zeros (1+1 , J+l) ; 
f=@(x)  x.  *  (1-x); 

for  i=l : 1+1 ;  U(i , l)=f ( (i-1) *h) ;  end 
for  i=l:I+l;  U(i,J+l)=0;  end 
for  j=2:J;  U(l,j)=0;  end 
for  j  =2 : J ;  U(I+l,j)=0;  end 
for  n=l:Numit 
for  i=2:I 
for  j  =2 : J 

U(i, j)=r*(U(i-l , j)+U(i+l , j) )+s* (U(i , j-l)+U(i , j+1) ) ; 
end 
end 
end 

[X , Y] =meshgrid (0 : h : xmax , 0 : k : ymax) ; 
mesh(X5 ,Y5 ,U, ’EdgeColor5 , 5blackJ) 

xlabel ( 5  x 5 , 5  FontSize 5 , 14) , ylabel ( 5  y 5 , ; FontSize  5 , 14) 


A 

Differential  Equations 


PDEs  are  frequently  solved  by  reducing  them  to  one  or  more  ODEs.  This 
appendix  contains  a  brief  review  of  how  to  solve  some  of  the  basic  ODEs  encoun¬ 
tered  in  this  book. 

For  notation,  we  let  y  =  y(x)  be  the  unknown  function.  Derivatives  will 
be  denoted  by  primes,  i.e.,  y '  =  y\x),  y "  =  yr\x).  Sometimes  we  use  the 
differential  notation  yr  =  If  /  is  a  function,  an  antiderivative  is  defined 
as  a  function  F  whose  derivative  is  /,  i.e.,  Ff(x)  =  f(x).  Antiderivatives  are 
unique  only  up  to  an  additive  constant,  and  they  are  often  denoted  by  the  usual 
indefinite  integral  sign: 

F(x)  =  [  f(x)dx  +  C. 


An  arbitrary  constant  of  integration  C  is  added  to  the  right  side.  However,  in 
this  last  expression,  it  is  often  impossible  to  evaluate  the  antiderivative  F  at  a 
particular  value  of  x  because  the  indefinite  integral  cannot  be  found  in  terms 
of  familiar  functions.  ^Take,  for  example,  f(x)  =  e~x'2  In  this  case  we  must 

use  the  general  form  of  the  antiderivative  as  a  definite  integral  with  a  variable 
upper  limit, 

rX 

F(x)  =  /  f(s)ds  +  C, 

J  a 

where  a  is  any  constant  (observe  that  a  and  C  are  not  independent,  since 
changing  one  changes  the  other).  If  /  is  continuous  at  x,  by  the  fundamental 
theorem  of  calculus,  F'  (x)  =  f(x). 


(c)  Springer  International  Publishing  Switzerland  2015 

J.  D.  Logan,  Applied  Partial  Differential  Equations,  3rd  edition. 

Undergraduate  Texts  in  Mathematics,  DOI  10.1007/978-3-319-12493-3 
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First-Order  Equations 


An  ODE  of  the  first  order  is  an  equation  of  the  form 


G[x,y,y'j  =  0  or  y'  =  F(x,y) 


There  are  three  types  of  equations  that  occur  regularly  in  PDEs:  separable, 
linear,  and  Bernoulli.  The  general  solution  involves  an  arbitrary  constant  C 
that  can  be  determined  by  an  initial  condition  of  the  form  y(x o)  =  yo . 

Separable  Equations.  A  first-order  equation  is  separable  if  it  can  be  written 
in  the  form 

S = m3iy)- 

In  this  case  we  separate  variables  and  write 

dy 


g(y ) 


f(x)dx. 


Integrating  both  sides  gives 


dy 

g(y) 


=  J  f(x)dx  +  C, 


which  defines  the  solution  implicitly.  As  noted  above,  sometimes  the  antideriva¬ 
tives  must  be  written  as  definite  integrals  with  a  variable  upper  limit  of  inte¬ 
gration. 

Linear  Equations.  A  first-order  linear  equation  is  one  of  the  form 

y1  +p(x)y  =  q(x). 

This  can  be  solved  by  multiplying  both  sides  by  an  integrating  factor  of  the 
form 

P(x)  =  e/X8)*. 

This  transforms  the  left  side  of  the  equation  into  a  total  derivative,  and  it 
becomes 


d_ 

dx 


( P(x)y )  =  P{x)q{x), 


Now,  both  sides  can  be  integrated  from  a  to  x  to  find  y. 

For  example,  to  find  an  expression  for  the  solution  to  the  initial  value  prob¬ 
lem 

y'  +  2  xy  =  y/x,  y(0)  =  3, 

We  note  that  the  integrating  factor  is 


P{x)  =  exp  (/  2 sds 
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Multiplying  both  sides  of  the  equation  by  the  integrating  factor  gives 


Now,  integrating  from  0  to  x  (while  changing  the  dummy  variable  of  integration 
to  s)  gives 


Solving  for  y  gives 


ds. 


Bernoulli  Equations.  Bernoulli  equations  are  nonlinear  equations  having  the 
form 

y'  +p(x)y  =  q{x)yn . 

The  transformation  of  dependent  variables  w  =  y1_"  transforms  a  Bernoulli 
equation  into  a  first-order  linear  equation  for  w. 

Second-Order  Equations 


Constant-Coefficient  Equations.  The  linear  equation 


ay"  +  by'  +  cy  =  0, 

where  a,  6,  and  c  are  constants,  occurs  frequently  in  applications.  We  recall 
that  the  general  solution  of  a  linear,  second-order,  homogeneous  equation  is  a 
linear  combination  of  two  independent  solutions.  That  is,  if  yi(x)  and  y2{x) 
are  independent  solutions,  then  the  general  solution  is 

y  =  ciyi(x)  +  c2y2(x), 

where  c\  and  C2  are  arbitrary  constants.  If  we  try  a  solution  of  the  form  y  =  e17100 , 
where  m  is  to  be  determined,  then  substitution  into  the  equation  gives  the  so- 
called  characteristic  equation 

9 

am  +  brn  +  c  =  0 

for  m.  This  is  a  quadratic  polynomial  that  will  have  two  roots,  m\  and  m2, 
called  eigenvalues.  Three  possibilities  can  occur:  unequal  real  roots,  equal  real 
roots,  and  complex  roots  (which  must  be  complex  conjugates). 

Case  (I).  mi,  m2  real  and  unequal.  In  this  case  two  independent  solutions  are 

2/i  =  emiX  y2  =  em2X. 
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Case  (II).  mi,  m2  real  and  equal,  i.e. ,  mi  =  m2  =  m.  In  this  case  two  indepen¬ 
dent  solutions  are 


V 1 


rax 

c 


y  2  = 


mcc 


Case  (HI),  mi  =  a  +  i/3,  m2  =  a  — i [3  are  complex  conjugate  roots.  In  this  case 
two  real,  independent  solutions  are 


yi  =  eax  sin  /3x  1/2  =  cos  fix. 

Of  particular  importance  are  the  two  equations 

y"  +  a2y  =  0,  y"  -  a2y  =  0 


which  have  general  solutions 

y(x)  =  ci  cos  ax  +  C2  sin  ax,  y  =  c\e~ax  +  C2eax, 
respectively.  Equivalently,  this  second  solution  can  be  written 


y  =  ci  cosh  ax  +  C2  sinh  ax. 

These  equations  occur  so  frequently  that  it  is  best  to  memorize  their  solutions. 

Cauchy— Euler  Equations.  It  is  difficult  to  solve  second-order  linear  equa¬ 
tions  with  variable  coefficients.  The  reader  may  recall  power  series  methods  are 
generally  applied.  However,  there  is  a  special  equation  that  can  be  solved  with 
a  simple  formula,  namely,  a  Cauchy-Euler  equation  of  the  form 

ax2y"  +  bxy  +  cy  —  0. 

This  equation  admits  power  functions  as  solutions.  Hence,  if  we  try  a  solu¬ 
tion  of  the  form  y  =  xm,  where  m  is  to  be  determined,  then  we  obtain  upon 
substitution  the  characteristic  equation 

am(m  —  1)  +  brri  +  c  =  0. 

This  quadratic  equation  has  two  roots,  mi  and  m2.  Thus,  there  are  three  cases: 
Case  (I).  mi,  m2  real  and  unequal.  In  this  case  two  independent  solutions  are 

yi  =  xmi  and  2/2  =  xm2 . 

Case  (II).  mi,  m2  real  and  equal,  i.e.,  mi  =  m2  =  m.  In  this  case  two  indepen¬ 
dent  solutions  are 

yi  =  xm  and  2/2  =  In  x. 

Case  (HI),  mi  =  a-M/?,  m2  =  a  — if3  are  complex  conjugate  roots.  In  this  case 
two  real,  independent  solutions  are 

2/i  =  xa  sin (/3  In  x)  and  2/2  =  cos(/3  In  x). 
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Particular  Solutions 


The  general  solution  of  the  nonhomogeneous  equation 

y"  +  p(x)y'  +  q(x)y  =  f(x) 


is 

y  =  ci  2/1  (a:)  +  c2y2(x)  +  yp(x), 

where  y\  and  y2  are  independent  solutions  of  the  homogeneous  equation  (when 
f(x)  =  0),  and  yp(x)  is  any  particular  solution  to  the  inhomogeneous  equation. 
For  constant-coefficient  equations  a  particular  solution  can  sometimes  be  found 
by  judiciously  ‘guessing’  the  form  from  the  form  of  f(x);  this  method  is  called 
the  method  of  undetermined  coefficients.  In  all  cases,  however,  there  is  a  general 
formula,  called  the  variation  of  parameters  formula,  which  gives  the  particular 
solution  in  terms  of  the  two  linearly  independent  solutions  y\  and  y2  of  the 
homogeneous  equation.  The  formula,  which  is  derived  in  elementary  texts,  is 
given  by 


yp(x) 


yi(s)f(s) 

W(s) 


ds 


W(s) 


ds 


•> 


where 

W(x)  =  yi{x)y'2(x)  -  y2(x)y[(x) 

is  the  Wronskian. 

There  are  several  introductory  texts  on  differential  equations.  A  classic  text 
is  Boyce  and  DiPrima  (1995),  as  well  as  later  editions.  Birkhoff  and  Rota  (1978) 
and  Kelley  and  Peterson  (2004)  are  two  more  advanced  texts. 
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